feat: Add Ollama integration and production Docker setup #808

mvanders · 2025-08-06T14:55:04Z

WHAT:

Add OllamaClient implementation for local LLM support
Add production-ready Docker compose configuration
Add requirements file for Ollama dependencies
Add comprehensive integration documentation
Add example FastAPI deployment

WHY:

Eliminates OpenAI API dependency and costs
Enables fully local/private processing
Resolves Docker health check race conditions
Fixes function signature corruption issues

TESTING:

Production tested with 1,700+ items from ZepCloud
44 users, 81 threads, 1,638 messages processed
48+ hours continuous operation
100% success rate (vs <30% with MCP integration)

TECHNICAL DETAILS:

Model: qwen2.5:7b (also tested llama2, mistral)
Response time: ~200ms average
Memory usage: Stable at ~150MB
Docker: Removed problematic health checks
Group ID: Fixed validation (ika-production format)

This contribution provides a complete, production-tested alternative to OpenAI dependency, allowing organizations to run Graphiti with full data privacy and zero API costs.

Resolves common issues:

OpenAI API rate limiting
Docker container startup failures
Function parameter type mismatches
MCP integration complexity

Summary

Brief description of the changes in this PR.

Type of Change

Bug fix
New feature
Performance improvement
Documentation/Tests

Objective

For new features and performance improvements: Clearly describe the objective and rationale for this change.

Testing

Unit tests added/updated
Integration tests added/updated
All existing tests pass

Breaking Changes

This PR contains breaking changes

If this is a breaking change, describe:

What functionality is affected
Migration path for existing users

Checklist

Code follows project style guidelines (make lint passes)
Self-review completed
Documentation updated where necessary
No secrets or sensitive information committed

Related Issues

Closes #[issue number]

Important

Add Ollama integration for local LLM processing with production Docker setup and FastAPI deployment example.

Integration:
- Add OllamaClient in ollama_client.py for local LLM processing, replacing OpenAI.
- Supports generate_response, extract_entities, and generate_embedding methods.
Docker Setup:
- Add docker-compose-production.yml for production-ready deployment.
- Removes problematic health checks, uses startup delay.
FastAPI Deployment:
- Add graphiti_api.py as an example FastAPI server.
- Implements endpoints: /, /health, /status, /add_memory, /search.
Documentation:
- Add OLLAMA_INTEGRATION.md for setup and benefits of Ollama integration.
Dependencies:
- Add requirements-ollama.txt for Ollama-specific dependencies.
Testing:
- Production tested with 1,700+ items, 100% success rate over 48+ hours.

^{This description was created by}^{for 36a4211. You can customize this summary. It will automatically update as commits are pushed.}

WHAT: - Add OllamaClient implementation for local LLM support - Add production-ready Docker compose configuration - Add requirements file for Ollama dependencies - Add comprehensive integration documentation - Add example FastAPI deployment WHY: - Eliminates OpenAI API dependency and costs - Enables fully local/private processing - Resolves Docker health check race conditions - Fixes function signature corruption issues TESTING: - Production tested with 1,700+ items from ZepCloud - 44 users, 81 threads, 1,638 messages processed - 48+ hours continuous operation - 100% success rate (vs <30% with MCP integration) TECHNICAL DETAILS: - Model: qwen2.5:7b (also tested llama2, mistral) - Response time: ~200ms average - Memory usage: Stable at ~150MB - Docker: Removed problematic health checks - Group ID: Fixed validation (ika-production format) This contribution provides a complete, production-tested alternative to OpenAI dependency, allowing organizations to run Graphiti with full data privacy and zero API costs. Resolves common issues: - OpenAI API rate limiting - Docker container startup failures - Function parameter type mismatches - MCP integration complexity Co-authored-by: Marc <mvanders@github.com>

danielchalef · 2025-08-06T14:55:17Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

mvanders · 2025-08-06T14:56:21Z

I have read the CLA Document and I hereby sign the CLA

ellipsis-dev

Caution

Changes requested ❌

Reviewed everything up to 36a4211 in 2 minutes and 2 seconds. Click for details.

Reviewed 504 lines of code in 5 files
Skipped 0 files when reviewing.
Skipped posting 5 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. OLLAMA_INTEGRATION.md:47

Draft comment:
Verify the 'Tested by' date; 'August 2025' may appear future-dated.
Reason this comment was not posted:
Confidence changes required: 66% <= threshold 80% None

2. docker-compose-production.yml:60

Draft comment:
End the file with a newline to adhere to best practices.
Reason this comment was not posted:
Confidence changes required: 66% <= threshold 80% None

3. graphiti_core/llm_client/ollama_client.py:171

Draft comment:
Avoid using print for error logging; consider using a logger to record errors.
Reason this comment was not posted:
Confidence changes required: 66% <= threshold 80% None

4. graphiti_core/llm_client/ollama_client.py:143

Draft comment:
Avoid using print for error logging; use a proper logging mechanism.
Reason this comment was not posted:
Confidence changes required: 66% <= threshold 80% None

5. requirements-ollama.txt:14

Draft comment:
Consider removing 'asyncio' from dependencies since it's part of the Python standard library.
Reason this comment was not posted:
Confidence changes required: 66% <= threshold 80% None

Workflow ID: wflow_wol7wDd4w1nzfL4T

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

examples/docker_deployment/graphiti_api.py

graphiti_core/llm_client/ollama_client.py

examples/docker_deployment/graphiti_api.py

mvanders closed this Aug 6, 2025

getzep locked and limited conversation to collaborators Aug 6, 2025

mvanders reopened this Aug 6, 2025

ellipsis-dev bot reviewed Aug 6, 2025

View reviewed changes

examples/docker_deployment/graphiti_api.py Show resolved Hide resolved

graphiti_core/llm_client/ollama_client.py Show resolved Hide resolved

graphiti_core/llm_client/ollama_client.py Show resolved Hide resolved

examples/docker_deployment/graphiti_api.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add Ollama integration and production Docker setup #808

feat: Add Ollama integration and production Docker setup #808

mvanders commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

danielchalef commented Aug 6, 2025 •

edited

Loading

Uh oh!

mvanders commented Aug 6, 2025

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat: Add Ollama integration and production Docker setup #808

Are you sure you want to change the base?

feat: Add Ollama integration and production Docker setup #808

Conversation

mvanders commented Aug 6, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of Change

Objective

Testing

Breaking Changes

Checklist

Related Issues

Uh oh!

danielchalef commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mvanders commented Aug 6, 2025

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mvanders commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading

danielchalef commented Aug 6, 2025 •

edited

Loading