wllama
AI/MLShare
AI Summary
wllama is a WebAssembly binding for llama.cpp that enables large language model inference directly within a web browser, eliminating the need for server-side processing. It is designed for developers and researchers who want to run LLMs client-side for privacy, offline use, or low-latency applications. Its interest lies in democratizing AI by making powerful models accessible through a standard web interface without requiring specialized hardware or cloud dependencies.
Cross-platform signals
GH
ViewGitHub
1.1k
stars
105
forks
Updated 2026-07-05
You might also like
More in AI/ML
odysseus
Self-hosted AI workspace.
80.7k
ponytail
Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.
74k
DeepSeek-Reasonix
DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.
26k
nature-skills
25.9k