Contributing to Mellea

Prerequisites: Python 3.11+, uv installed, Ollama installed.

Contribution pathways

Three pathways exist for contributing to Mellea: Core repository — bug fixes, standard library additions (Requirements, Components, Sampling Strategies), backend improvements, documentation, and tests. Follow the Pull request process below. Applications and libraries — build tools or applications on top of Mellea in your own repository. Use the mellea- prefix for discoverability (e.g., github.com/my-company/mellea-legal-utils). Community components — contribute experimental or specialized components to mellea-contribs. Open an issue first for general-purpose additions to decide whether they belong in the standard library or in mellea-contribs.

Development setup

Set up with uv (recommended)

Fork and clone the repository:

git clone ssh://git@github.com/<your-username>/mellea.git
cd mellea/

Create a virtual environment:

uv venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

# Install all dependencies (recommended for development)
uv sync --all-extras --all-groups

# Or install only backend dependencies
uv sync --extra backends --all-groups

Install pre-commit hooks (required):
```
pre-commit install
```

Note: Python 3.13+ requires a Rust compiler for the outlines dependency. Use Python 3.12 if you prefer to avoid this.

Set up with conda or mamba

Fork and clone the repository:

git clone ssh://git@github.com/<your-username>/mellea.git
cd mellea/

Run the installation script:
```
conda/install.sh
```
The script handles environment setup, dependency installation, and pre-commit hook installation.

Verify the installation

# Start Ollama (required for most tests)
ollama serve

# Run fast tests (skip qualitative tests, ~2 min)
uv run pytest -m "not qualitative"

Coding standards

Type annotations

Type annotations are required on all core functions:

def process_text(text: str, max_length: int = 100) -> str:
    """Process text with maximum length."""
    return text[:max_length]

Docstrings

Docstrings serve as prompts — the LLM reads them, so be specific. Use Google-style docstrings:

def extract_entities(text: str, entity_types: list[str]) -> dict[str, list[str]]:
    """Extract named entities from text.

    Args:
        text: The input text to analyze.
        entity_types: List of entity types to extract (e.g., ["PERSON", "ORG"]).

    Returns:
        Dictionary mapping entity types to lists of extracted entities.

    Example:
        >>> extract_entities("Alice works at IBM", ["PERSON", "ORG"])
        {"PERSON": ["Alice"], "ORG": ["IBM"]}
    """
    ...

Code style

Use Ruff for linting and formatting.
Use ... in @generative function bodies.
Prefer primitives over classes.
Keep functions focused and single-purpose.

Linting and formatting

# Format code
uv run ruff format .

# Lint code
uv run ruff check .

# Fix auto-fixable issues
uv run ruff check --fix .

# Type check
uv run mypy .

Development workflow

Commit messages

Follow Angular commit format:

<type>: <subject>

<body>

<footer>

Types: feat, fix, docs, test, refactor, release Example:

feat: add support for streaming responses

Implements streaming for all backend types with proper
error handling and timeout management.

Closes #123

Always sign off commits with -s or --signoff:

git commit -s -m "feat: your commit message"

Branch naming: feat/topic, fix/issue-id, docs/topic

Pre-commit hooks

Pre-commit hooks run automatically before each commit and check:

Ruff — linting and formatting
mypy — type checking
uv-lock — dependency lock file sync
codespell — spell checking

Run hooks manually:

pre-commit run --all-files

Warning: pre-commit --all-files may take several minutes. Do not cancel mid-run as it can corrupt state.

Use the -n flag to bypass hooks for intermediate work-in-progress commits:

git commit -n -m "wip: intermediate work"

Testing

Test markers

Tests use a four-tier granularity system. Every test belongs to exactly one tier:

Tier	When to use	How to apply
`unit`	Self-contained, no services, no I/O	Auto-applied — never write `@pytest.mark.unit`
`integration`	Real SDK/library boundary or multi-component wiring	`@pytest.mark.integration`
`e2e`	Real backends (Ollama, APIs, GPU models), deterministic assertions	`@pytest.mark.e2e` + backend marker(s)
`qualitative`	Subset of e2e with non-deterministic output assertions	`@pytest.mark.qualitative` per-function, `e2e` + backend at module level

Backend markers (only for e2e/qualitative tests):

Marker	Backend	Resources
`ollama`	Ollama (port 11434)	Local, light (~2–4 GB RAM)
`openai`	OpenAI API or compatible	API calls (may use Ollama `/v1`)
`watsonx`	Watsonx API	API calls, requires credentials
`huggingface`	HuggingFace transformers	Local, GPU required
`litellm`	LiteLLM (wraps other backends)	Depends on underlying backend
`bedrock`	AWS Bedrock	API calls, requires credentials

Resource predicates (from test/predicates.py, for e2e/qualitative tests):

Predicate	Use when test needs
`require_gpu()`	Any GPU (CUDA or MPS)
`require_gpu(min_vram_gb=N)`	GPU with at least N GB VRAM
`require_ram(min_gb=N)`	N GB+ system RAM
`require_api_key("ENV_VAR")`	Specific API credentials
`require_package("pkg")`	Optional dependency
`require_python((3, 11))`	Minimum Python version

Other markers:

Marker	Purpose
`slow`	Tests taking >1 minute (excluded by default)
`qualitative`	Non-deterministic output (skipped when `CICD=1`)

For more information, see our Markers Guide.

Running tests

# Install all dependencies (required for tests)
uv sync --all-extras --all-groups

# Start Ollama (required for most tests)
ollama serve

# Default: runs qualitative tests, skips slow tests
uv run pytest

# Fast tests only (no qualitative, ~2 min)
uv run pytest -m "not qualitative"

# Run only slow tests (>5 min)
uv run pytest -m slow

# Run specific backend tests
uv run pytest -m "ollama"
uv run pytest -m "openai"

# Run unit tests only (no backends needed)
uv run pytest -m unit

# CI/CD mode (skips qualitative tests)
CICD=1 uv run pytest

Timing expectations

Run	Duration
Fast tests (`-m "not qualitative"`)	~2 minutes
Default (qualitative, no slow)	Several minutes
Slow tests (`-m slow`)	More than 1 minute
Pre-commit hooks	1–5 minutes

Replicate CI locally

# Run pre-commit checks (same as CI)
pre-commit run --all-files

# Run tests with CICD flag (same as CI, skips qualitative tests)
CICD=1 uv run pytest

Pull request process

Create an issue describing your change (if one does not already exist).
Fork the repository.
Create a branch in your fork using the naming convention above.
Make your changes following the coding standards.
Add tests for new functionality.
Run the test suite to confirm everything passes.
Update documentation as needed.
Push to your fork and open a pull request.
Follow the automated PR workflow instructions in the PR template.

Troubleshooting

Problem	Fix
`ComponentParseError`	LLM output did not match expected type. Add examples to the docstring.
`uv.lock` out of sync	Run `uv sync` to update the lock file.
`Ollama refused connection`	Run `ollama serve` to start the Ollama server.
`ConnectionRefusedError` (port 11434)	Ollama is not running. Start with `ollama serve`.
`TypeError: missing positional argument`	First argument to a `@generative` function must be session `m`.
Output is wrong or None	Model too small or prompt insufficient. Try a larger model or add a `reasoning` field.
`error: can't find Rust compiler`	Python 3.13+ requires Rust for outlines. Install Rust or use Python 3.12.
Tests fail on Intel Mac	Use conda: `conda install 'torchvision>=0.22.0'` then `uv pip install mellea`.
Pre-commit hooks fail	Run `pre-commit run --all-files` to see specific issues. Fix them, or use `git commit -n` to bypass.

Debugging tips

from mellea.core import FancyLogger

# Enable debug logging
FancyLogger.get_logger().setLevel("DEBUG")

# Inspect the exact prompt sent to the LLM
print(m.last_prompt())

Contributing to the docs

Documentation lives in docs/docs/. The writing guide at docs/docs/guide/CONTRIBUTING covers conventions, the PR checklist, and the review process for documentation contributions. Key points:

Start body content with H2 — Mintlify renders the frontmatter title as the page heading.
Omit .md extensions from internal links.
Tag every fenced code block with a language.
Run npx markdownlint-cli2 and fix all warnings before committing.

Getting help

Check existing issues
Join the Github Discussions
Open a new issue with the appropriate label

See also: Building Extensions

Getting Started

Tutorials

Concepts

How-To

Examples

Integrations

Evaluation and Observability

Advanced

Community

Reference

Troubleshooting

Contributing to Mellea

Contribution pathways

Development setup

Set up with uv (recommended)

Set up with conda or mamba

Verify the installation

Coding standards

Type annotations

Docstrings

Code style

Linting and formatting

Development workflow

Commit messages

Pre-commit hooks

Testing

Test markers

Running tests

Timing expectations

Replicate CI locally

Pull request process

Troubleshooting

Debugging tips

Contributing to the docs

Getting help

Getting Started

Tutorials

Concepts

How-To

Examples

Integrations

Evaluation and Observability

Advanced

Community

Reference

Troubleshooting

Documentation Index

​Contribution pathways

​Development setup

​Set up with uv (recommended)

​Set up with conda or mamba

​Verify the installation

​Coding standards

​Type annotations

​Docstrings

​Code style

​Linting and formatting

​Development workflow

​Commit messages

​Pre-commit hooks

​Testing

​Test markers

​Running tests

​Timing expectations

​Replicate CI locally

​Pull request process

​Troubleshooting

​Debugging tips

​Contributing to the docs

​Getting help

Contribution pathways

Development setup

Set up with uv (recommended)

Set up with conda or mamba

Verify the installation

Coding standards

Type annotations

Docstrings

Code style

Linting and formatting

Development workflow

Commit messages

Pre-commit hooks

Testing

Test markers

Running tests

Timing expectations

Replicate CI locally

Pull request process

Troubleshooting

Debugging tips

Contributing to the docs

Getting help