on-device-inference

Here are 23 public repositories matching this topic...

umitkacar / awesome-tinyml

TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.

Updated Nov 10, 2025
Python

Routstr / local-plus-plus

Star

iOS + Android app that runs local LLMs on-device + routstr cloud LLMs for anonymous inference

react-native local-llm routstr on-device-inference

Updated Sep 18, 2025
TypeScript

nobodywho-ooo / flutter-starter-example

Sponsor

Star

Flutter starter example app to get started with NobodyWho, a library designed to run LLMs locally and efficiently on any device.

dart ai flutter slm inference-engine on-device-ai llm on-device-inference

Updated Mar 19, 2026
Dart

umitkacar / awesome-mobile-ai

Star

Mobile AI: iOS CoreML, Android TFLite, on-device inference, ONNX, TensorRT, and ML deployment for smartphones.

quantization mlkit tensorrt mnn edge-computing coreml ncnn onnx tensorflow-lite openvino mobile-ai mobile-inference pytorch-mobile model-optimization neural-engine android-ml on-device-inference ios-ml smartphone-ai

Updated Nov 10, 2025
Python

umitkacar / executorch-android-inference

Star

Production Android AI with ExecuTorch 1.0 - Deploy PyTorch models to mobile with NPU acceleration and 50KB footprint

kotlin-android java-android pytorch-mobile pytorch-android edge-inference android-ai real-time-ai executorch arm-optimization mobile-ml privacy-preserving-ai mobile-deployment on-device-inference npu-acceleration executorch-1-0 qualcomm-hexagon lightweight-ml cpu-gpu-npu meta-executorch

Updated Nov 14, 2025
Python

Siddhesh2377 / llama.cpp-android

Sponsor

Star

Custom llama.cpp fork with character intelligence engine: control vectors, attention bias, head rescaling, attention temperature, fast weight memory

android c-plus-plus ndk jni quantization attention-mechanism arm-neon edge-ai mobile-ai llama-cpp character-ai ggml gguf on-device-inference control-vectors

Updated Mar 2, 2026
C++

whyisitworking / llama-bro

Star

High-performance Android SDK for on-device LLM inference (GGUF). Privacy-focused, offline-first, and powered by llama.cpp with a clean Kotlin Coroutines API.

android cmake ai ndk android-library llama android-app android-package on-device-ai ndk-jni ai-assistant llamacpp llama-cpp on-device-models on-device-inference on-device-llm

Updated Mar 18, 2026
Kotlin

MimicScribe / dtln-aec-coreml

Star

Neural acoustic echo cancellation for Apple platforms using CoreML — Swift package with 128/256/512-unit DTLN-aec models

audio macos swift echo-cancellation coreml aec noise-suppression apple-silicon dtln on-device-inference

Updated Mar 9, 2026
Swift

umitkacar / sam2-edge-cpp

Star

Real-time SAM2 segmentation on edge devices - 40x faster C++ inference with ONNX Runtime for iOS/Android deployment

Updated Nov 14, 2025
C++

berkaygediz / IterativeAds

Sponsor

Star

Ad generation via offline LLMs with on-device inference, optionally managed by a self-hosted CMS.

cms offline ad ads self-hosted optional on-device ad-generator llm llms on-device-inference ad-generation

Updated Feb 28, 2026

ghostapp-ai / ghost

Star

The Private Agent OS — search files, run AI agents, connect to 10,000+ tools via the complete protocol stack (MCP, AG-UI, A2UI, A2A). Zero cloud. Zero telemetry. On-device inference.

Updated Mar 18, 2026
Rust

MimicScribe / swift-bnns-graph

Star

Swift wrapper for Apple's BNNS graph API — run compiled CoreML models (.mlmodelc) on CPU with zero-copy buffer management

macos swift machine-learning accelerate bnns coreml apple-silicon on-device-inference

Updated Mar 9, 2026
Swift

nirgranthi / local-slave

Star

Run small LLMs directly on your device, no cloud computing needed.

javascript open-source neural-networks edge-ai ai-models local-llm local-ai open-source-ai web-ai tiny-llm on-device-inference browser-llm client-side-ml

Updated Mar 12, 2026
TypeScript

hilum-labs / local-llm-rn

Star

React Native SDK for local LLM inference and on-device AI on iOS and Android.

javascript android ios typescript react-native mobile-ai llm local-llm on-device-inference

Updated Mar 14, 2026
TypeScript

hilum-labs / local-llm

Star

Open source Node.js runtime for local LLM inference, on-device AI, and private model execution.

nodejs javascript ai llm local-llm local-ai offline-ai private-ai on-device-inference inference-runtime

Updated Mar 15, 2026
TypeScript

hilum-labs / local-llm-web

Star

Web JavaScript SDK for local LLM inference with WebGPU and on-device AI.

javascript typescript sdk web webgpu llm client-side-ai local-llm web-ai on-device-inference

Updated Mar 10, 2026

hilum-labs / local-llm-web-core

Star

WebGPU runtime core for local LLM inference, on-device AI, and client-side model execution.

javascript web runtime gpu webgpu inference-engine llm client-side-ai local-llm on-device-inference

Updated Mar 10, 2026

ondeinference / onde

Star

On-device inference engine for Apple silicon

inference-engine on-device-ai on-device-inference

Updated Mar 15, 2026

MustafaBeratYavas / PLANT-DISEASE-EARLY-DIAGNOSIS-SYSTEM

Star

Offline plant disease diagnosis system powered by MobileNetV3-Large and TensorFlow Lite — 38 disease classes, 14 crop species, ~5.58ms inference on-device. Built with Flutter & Python.

dart computer-vision deep-learning agriculture tensorflow mobile-app image-classification transfer-learning flutter quantization undp precision-agriculture tensorflow-lite edge-ai mobilenetv3 plant-disease-detection samsung-innovation-campus on-device-inference

Updated Feb 23, 2026
Dart

Siddhesh2377 / blog

Sponsor

Star

Deep technical writing on edge AI, on-device inference, llama.cpp, GGML, and mobile AI engineering

android blog c-plus-plus jni quantization qualcomm technical-writing arm-neon edge-ai mobile-ai llama-cpp ggml on-device-inference

Updated Mar 14, 2026
Svelte

Improve this page

Add a description, image, and links to the on-device-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the on-device-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

on-device-inference

Here are 23 public repositories matching this topic...

umitkacar / awesome-tinyml

Routstr / local-plus-plus

nobodywho-ooo / flutter-starter-example

umitkacar / awesome-mobile-ai

umitkacar / executorch-android-inference

Siddhesh2377 / llama.cpp-android

whyisitworking / llama-bro

MimicScribe / dtln-aec-coreml

umitkacar / sam2-edge-cpp

berkaygediz / IterativeAds

ghostapp-ai / ghost

MimicScribe / swift-bnns-graph

nirgranthi / local-slave

hilum-labs / local-llm-rn

hilum-labs / local-llm

hilum-labs / local-llm-web

hilum-labs / local-llm-web-core

ondeinference / onde

MustafaBeratYavas / PLANT-DISEASE-EARLY-DIAGNOSIS-SYSTEM

Siddhesh2377 / blog

Improve this page

Add this topic to your repo