Our website uses necessary cookies to enable basic functions and optional cookies to help us to enhance your user experience. Learn more about our cookie policy by clicking "Learn More".
Accept All Only Necessary Cookies
LLM tester with llama.cpp icon

LLM tester with llama.cpp


Mick Lab - Mitsuo Kuroda
1.400
  • May 5, 2026
    Update date
  • Security
  • Everyone
  • Android 7.0+
    Android OS

About LLM tester with llama.cpp

Run local GGUF LLMs with custom settings and an optional Ollama‑compatible API.

1. What This App Can Do

This app lets you run large language models (LLMs) entirely on your Android device, enabling private text generation without external servers.

You can load GGUF models from HuggingFace or from local storage, providing a flexible offline AI environment.

The app supports a wide range of llama.cpp‑compatible models, including Gemma‑4 and Bonsai.

An Ollama‑compatible API server is also included, allowing other apps or scripts to access your local LLM via standard HTTP endpoints.

Web UI is Available. Vision as multimodel is Available with Gemma-4.

Upload mmproj file from local device. Only one mmproj model is acceptable at same time.

MCP and Function Calling are available.

---

2. Intended Users and Supported Devices

Ideal for:

- Users who want fully local LLMs

- Those using GGUF models from HuggingFace or local files

- Advanced users needing detailed parameter control

- Developers calling a local LLM from their apps

- Privacy‑focused users

The app works on a wide range of Android devices. By adjusting context size, threads, and batch size, you can tune performance for your hardware.

---

3. Key Features

- Load GGUF models from HuggingFace or local storage

- Fully offline inference

- Supports Gemma‑4 and other llama.cpp‑compatible models

- Detailed parameter settings (Mirostat, DRY, XTC, etc.)

- Ollama‑compatible API: /api/chat, /api/generate, /api/tags

- Automatic prompt template selection

- Streaming output option

- Comprehensive logs and UI safeguards

- Various minor improvements for stability and usability

- Web UI is Available.

- Vision as multimodel is Available with Gemma-4.

---

4. Getting Started

1. Open Settings.

2. Enter a HuggingFace GGUF URL or choose a local GGUF file, then tap Load Model.

3. Adjust parameters and tap Save Config.

4. Tap SAVE & CLOSE to apply settings.

---

5. Main Screen Functions

- Enter Prompt: Input your instruction

- Send: Start generation

- Re‑init Model: Reload current model

- View Log / Clear Log

- Start/Stop API Server

- Copy output or logs

- View timestamped processing logs

---

6. Settings Screen Highlights

- Save/load/delete configurations

- Model selection from URL or local storage

- Context size, temperature, Mirostat, DRY, XTC, etc.

- Streaming output toggle

- Custom or auto‑selected prompt templates

- API server port settings

- Log verbosity options

- Manual and privacy policy

---

7. Prompt Templates and Stop Sequences

The app detects the model family from the filename and selects an appropriate template.

It also stops generation when common delimiters appear to prevent runaway output.

Tips:

Gemma‑4 tends to repeat short phrases. Adding explicit anti‑repetition instructions in the system prompt or using stricter stop sequences can improve output quality.

---

8. API Server Capabilities

Provides:

- /api/chat

- /api/generate

- /api/tags

- /v1/chat/completions,

- /v1/models

- /props, /slots

- / for Web UI http://localhost:11434/

Only one generation request is processed at a time. Android 13+ may require notification permission.

---

9. How This App Stands Out

- GGUF loading from HuggingFace and local device

- Support for Gemma‑4 as multimodel via Web UI

- More detailed parameter control than typical local LLM apps

- Built‑in Ollama‑compatible API server

- Automatic template selection

- Flexible performance tuning

Show More

Additional APP Information

Latest Version

1.400

Uploaded by

Emma Diaz

Requires Android

Android 7.0+

License Required

Content Rating

Everyone

Security Report

What's New in the Latest Version 1.400

Last updated on May 5, 2026

MCP and Function Calling are Available.
Improve Settings usability.

Show More
Security Check Completed
No Virus
No Spyware
No Malware
No Virus
No Spyware
No Malware
Package Name:com.micklab.llama
SHA-256:4973b109268a125819c97d03d375b7b695ac49652a4537f7e5ae81c75a0c7f57
SHA-1:eb161900739fa8d5b520be57fc1d7b7b7b15300b
Show More
Verified by: APKPURE

LLM tester with llama.cpp Screenshots

LLM tester with llama.cpp posterLLM tester with llama.cpp screenshot 1LLM tester with llama.cpp screenshot 2LLM tester with llama.cpp screenshot 3LLM tester with llama.cpp screenshot 4

Related Tags

Popular Articles In Last 24 Hours

Subscribe to APKPure
Be the first to get access to the early release, news, and guides of the best Android games and apps.
No thanks
Sign Up
Subscribed Successfully!
You're now subscribed to APKPure.