Open a file to start editing
Output
ollama serve locally. Default model: qwen2.5-coder:7b. No API key needed.
GeminiPaste your API key in ⚙ Settings. Uses gemini-2.0-flash by default.
wllamaRecommended. Pick any .gguf — cached in IndexedDB after first load. Real token streaming. Works with LM Studio downloads.
transformers.jsDownloads ONNX model from HuggingFace (~300MB). No file needed, but slower and heavier than GGUF.
Inject fileCheckbox in LLM panel sends the open file as context with every message.
.gguf — from LM Studio, llama.cpp, or HuggingFace.HuggingFaceTB/SmolLM2-360M-InstructQwen/Qwen2.5-Coder-0.5B-Instruct