An AI coding assistant built in Rust that provides both command-line and graphical interfaces for autonomous code analysis and modification.
Multi-Modal Tool Execution: Adapts to different LLM capabilities with pluggable tool invocation modes - native function calling, XML-style tags, and triple-caret blocks - ensuring compatibility across various AI providers.
Real-Time Streaming Interface: Advanced streaming processors parse and display tool invocations as they stream from the LLM, with smart filtering to prevent unsafe tool combinations.
Session-Based Project Management: Each chat session is tied to a specific project and maintains persistent state, working memory, and draft messages with attachment support.
Multiple Interface Options: Choose between a modern GUI built on Zed's GPUI framework, traditional terminal interface, or headless MCP server mode for integration with MCP clients such as Claude Desktop.
Intelligent Project Exploration: Autonomously builds understanding of codebases through working memory that tracks file structures, dependencies, and project context.
git clone https://github.com/stippi/code-assistant
cd code-assistant
cargo build --release
The binary will be available at target/release/code-assistant
.
Create ~/.config/code-assistant/projects.json
to define available projects:
{
"code-assistant": {
"path": "/Users/<username>/workspace/code-assistant"
},
"my-project": {
"path": "/Users/<username>/workspace/my-project"
}
}
Important Notes:
- When launching from a folder not in this configuration, a temporary project is created automatically
- The assistant has access to the current project (including temporary ones) plus all configured projects
- Each chat session is permanently associated with its initial project and folder - this cannot be changed later
- Tool syntax (native/xml/caret) is also fixed per session at creation time
- The LLM provider selected at startup is used for the entire application session (UI switching planned for future releases)
# Start with graphical interface
code-assistant --ui
# Start GUI with initial task
code-assistant --ui --task "Analyze the authentication system"
# Basic usage
code-assistant --task "Explain the purpose of this codebase"
# With specific provider and model
code-assistant --task "Add error handling" --provider openai --model gpt-5
code-assistant server
Claude Desktop Integration
Configure in Claude Desktop settings (Developer tab → Edit Config):
LLM Providers
Anthropic (default):
export ANTHROPIC_API_KEY="sk-ant-..."
code-assistant --provider anthropic --model claude-sonnet-4-20250514
OpenAI:
export OPENAI_API_KEY="sk-..."
code-assistant --provider openai --model gpt-4o
SAP AI Core:
Create ~/.config/code-assistant/ai-core.json
:
{
"auth": {
"client_id": "<service-key-client-id>",
"client_secret": "<service-key-client-secret>",
"token_url": "https://<your-url>/oauth/token",
"api_base_url": "https://<your-url>/v2/inference"
},
"models": {
"claude-sonnet-4": "<deployment-id>"
}
}
Ollama:
code-assistant --provider ollama --model llama2 --num-ctx 4096
Other providers: Vertex AI (Google), OpenRouter, Groq, MistralAI
Advanced Options
Tool Syntax Modes:
--tool-syntax native
: Use the provider's built-in tool calling (most reliable, but streaming of parameters depends on provider)--tool-syntax xml
: XML-style tags for streaming of parameters--tool-syntax caret
: Triple-caret blocks for token-efficency and streaming of parameters
Session Recording:
# Record session (Anthropic only)
code-assistant --record session.json --task "Optimize database queries"
# Playback session
code-assistant --playback session.json --fast-playback
Other Options:
--continue-task
: Resume from previous session state--use-diff-format
: Enable alternative diff format for file editing--verbose
: Enable detailed logging--base-url
: Custom API endpoint
The code-assistant features several innovative architectural decisions:
Adaptive Tool Syntax: Automatically generates different system prompts and streaming processors based on the target LLM's capabilities, allowing the same core logic to work across providers with varying function calling support.
Smart Tool Filtering: Real-time analysis of tool invocation patterns prevents logical errors like attempting to edit files before reading them, with the ability to truncate responses mid-stream when unsafe combinations are detected.
Multi-Threaded Streaming: Sophisticated async architecture that handles real-time parsing of tool invocations while maintaining responsive UI updates and proper state management across multiple chat sessions.
Contributions are welcome! The codebase demonstrates advanced patterns in async Rust, AI agent architecture, and cross-platform UI development.
This section is not really a roadmap, as the items are in no particular order. Below are some topics that are likely the next focus.
- Block Replacing in Changed Files: When streaming a tool use block, we already know the LLM attempts to use
replace_in_file
and we know in which file quite early. If we also know this file has changed since the LLM last read it, we can block the attempt with an appropriate error message. - Compact Tool Use Failures: When the LLM produces an invalid tool call, or a mismatching search block, we should be able to strip the failed attempt from the message history, saving tokens.
- Improve UI: There are various ways in which the UI can be improved.
- Add Memory Tools: Add tools that facilitate building up a knowledge base useful work working in a given project.
- Security: Ideally, the execution for all tools would run in some sort of sandbox that restricts access to the files in the project tracked by git.
Currently, the tools reject absolute paths, but do not check whether the relative paths point outside the project or try to access git-ignored files.
The
execute_command
tool runs a shell with the provided command line, which at the moment is completely unchecked. - Fuzzy matching search blocks: Investigate the benefit of fuzzy matching search blocks.
Currently, files are normalized (always
\n
line endings, no trailing white space). This increases the success rate of matching search blocks quite a bit, but certain ways of fuzzy matching might increase the success even more. Failed matches introduce quite a bit of inefficiency, since they almost always trigger the LLM to re-read a file. Even when the error output of thereplace_in_file
tool includes the complete file and tells the LLM not to re-read the file.