Configuration

Learn how to use configuration files to manage LLM settings and guardrails.

Configuration

Overview

Fortified LLM Client supports two configuration file formats:

TOML (.toml) - Recommended for readability
JSON (.json) - For programmatic generation

Configuration files can specify:

LLM connection settings (API URL, model, authentication)
Sampling parameters (temperature, max tokens, seed)
Token validation settings
Response formatting options
Guardrails configuration (input/output validation)

Merge Behavior

Priority: CLI arguments > Config file

CLI arguments always override config file values. This allows you to:

Define base settings in the config file
Override specific values via CLI flags as needed

Example:

# config.toml has model="llama3"
fortified-llm-client -c config.toml --model gpt-4  # Uses gpt-4, not llama3

Partial Config Files

Config files do not need to include all fields. Any required field can be provided via CLI arguments instead. This enables patterns like:

Infrastructure config: Only api_url, model, api_key_name in the file; prompts provided via CLI
Guardrails-only config: Only [guardrails] section; all other fields via CLI
Base + override: Common settings in the file, per-run values via CLI

# Config file only has api_url, model, and guardrails
# System and user prompts provided via CLI
fortified-llm-client -c infra.toml --system-text "You are helpful" --user-text "Hello"

TOML Format

Basic Configuration

config.toml:

# Connection settings
api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"

# Sampling parameters
temperature = 0.7
max_tokens = 2000
seed = 42

# Token management
validate_tokens = true
context_limit = 8192

# Timeout
timeout_secs = 300

With System Prompt

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"
system_prompt = "You are a helpful Rust expert. Explain concepts clearly."
temperature = 0.7

With API Authentication

api_url = "https://api.openai.com/v1/chat/completions"
model = "gpt-4"
api_key_name = "OPENAI_API_KEY"  # Environment variable name
temperature = 0.7
max_tokens = 1000

Never commit API keys directly in config files! Use api_key_name to reference environment variables.

With Response Formatting

api_url = "https://api.openai.com/v1/chat/completions"
model = "gpt-4"
api_key_name = "OPENAI_API_KEY"

# JSON Schema validation
response_format = "json-schema"
response_format_schema = "schemas/product.json"
response_format_schema_strict = true

With Input Guardrails (Regex-Based)

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"
temperature = 0.7

[guardrails.input]
type = "regex"
max_length_bytes = 1048576  # 1MB
patterns_file = "patterns/input.txt"  # Optional: custom patterns
severity_threshold = "medium"  # Violations below this become warnings

Pattern file format (patterns/input.txt):

CRITICAL | Social Security Number | \b\d{3}-\d{2}-\d{4}\b
HIGH | Credit Card Number | \b\d{4}[- ]?\d{4}[- ]?\d{4}[- ]?\d{4}\b
MEDIUM | Email Address | \b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b

With Output Guardrails

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"

[guardrails.output]
type = "regex"
max_length_bytes = 2097152  # 2MB
patterns_file = "patterns/output.txt"
severity_threshold = "high"

Pattern file format (patterns/output.txt):

CRITICAL | Dangerous instructions | bomb|weapon|hack
HIGH | Toxic language | offensive|harmful
MEDIUM | Inappropriate content | controversial

With LLM-Based Guardrails (Llama Guard)

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"

[guardrails.input]
type = "llama_guard"

[guardrails.input.llama_guard]
api_url = "http://localhost:11434/v1/chat/completions"
model = "llama-guard-3"
max_tokens = 512
timeout_secs = 60

# Enable specific safety categories (S1-S13)
enabled_categories = ["S1", "S2", "S3"]  # Violent Crimes, Non-Violent Crimes, Sex-Related Crimes

See Guardrails for all guardrail types and options.

Hybrid Guardrails (Defense-in-Depth)

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"

[guardrails.input]
type = "composite"
execution = "sequential"     # or "parallel"
aggregation = "all_must_pass"  # or "any_can_pass"

# Layer 1: Fast regex checks
[[guardrails.input.providers]]
type = "regex"
max_length_bytes = 1048576
patterns_file = "patterns/input.txt"
severity_threshold = "medium"

# Layer 2: LLM-based validation
[[guardrails.input.providers]]
type = "llama_guard"
api_url = "http://localhost:11434/v1/chat/completions"
model = "llama-guard3:8b"
timeout_secs = 30
enabled_categories = ["S1", "S2", "S3", "S4", "S5"]

See Hybrid Guardrails for detailed configuration.

JSON Format

Basic Configuration

config.json:

{
  "api_url": "http://localhost:11434/v1/chat/completions",
  "model": "llama3",
  "temperature": 0.7,
  "max_tokens": 2000,
  "validate_tokens": true
}

With Guardrails

config.json:

{
  "api_url": "http://localhost:11434/v1/chat/completions",
  "model": "llama3",
  "guardrails": {
    "input": {
      "type": "regex",
      "max_length_bytes": 1048576,
      "patterns_file": "patterns/input.txt",
      "severity_threshold": "medium"
    },
    "output": {
      "type": "regex",
      "patterns": {
        "detect_toxic": true
      }
    }
  }
}

All Configuration Fields

Top-Level Fields

Field	Type	Description	Default
`api_url`	String	LLM API endpoint URL	None (required via CLI or config)
`model`	String	Model name/identifier	None (required via CLI or config)
`provider`	String	Force provider: `"openai"` or `"ollama"`	Auto-detect
`system_prompt`	String	System prompt text	None (required via CLI or config)
`system_prompt_file`	String	Path to system prompt file	None
`user_prompt`	String	User prompt text	None
`user_prompt_file`	String	Path to user prompt file	None
`temperature`	Float	Sampling temperature (0.0-2.0)	`0.0`
`max_tokens`	Integer	Maximum response tokens	Provider default
`seed`	Integer	Random seed for reproducibility	None
`validate_tokens`	Boolean	Enable token validation	`false`
`context_limit`	Integer	Override context window limit	Auto-detect
`response_format`	String	`"text"`, `"json-object"`, or `"json-schema"`	`"text"`
`response_format_schema`	String	Path to JSON Schema file	None
`response_format_schema_strict`	Boolean	Strict schema validation	`true`
`api_key`	String	API key (direct value)	None
`api_key_name`	String	Environment variable for API key	None
`timeout_secs`	Integer	Request timeout in seconds	`300`

All fields in the config file are optional. Required fields (api_url, model, system prompt, and user prompt) can be provided via CLI arguments instead. CLI arguments always take priority over config file values.

Guardrails Section

See Guardrails Configuration for complete details.

Unified Guardrails Configuration

You can apply the same guardrail configuration to both input and output using the flattened format:

# This applies to BOTH input and output
[guardrails]
type = "regex"
max_length_bytes = 1048576
patterns_file = "patterns/common.txt"
severity_threshold = "medium"

This is equivalent to:

[guardrails.input]
type = "regex"
max_length_bytes = 1048576
patterns_file = "patterns/common.txt"
severity_threshold = "medium"

[guardrails.output]
type = "regex"
max_length_bytes = 1048576
patterns_file = "patterns/common.txt"
severity_threshold = "medium"

Override Pattern: Explicit input or output fields take precedence over the flattened format:

# Base config for both
[guardrails]
type = "regex"
max_length_bytes = 1048576

# Override only for output
[guardrails.output]
max_length_bytes = 2097152  # 2MB for longer responses

CLI-Only Fields

These fields cannot be set in config files and must be provided via CLI:

config_file - Path to config file itself
verbose - Enable verbose logging
quiet - Suppress all logging
output - Output file path
enable_input_validation - Simple CLI-based input validation
max_input_length - Max input bytes (CLI validation)
max_input_tokens - Max input tokens (CLI validation)

For guardrails, use the [guardrails] section in config files instead of CLI flags.

Usage Examples

Example 1: CLI with Config

# Use config as base
fortified-llm-client -c config.toml --user-text "Hello"

# Override specific values
fortified-llm-client -c config.toml --temperature 1.0 --user-text "Be creative"

Example 2: Library with Guardrails

use fortified_llm_client::{
    evaluate, ConfigBuilder,
    guardrails::{GuardrailProviderConfig, RegexGuardrailConfig, Severity},
};
use std::path::PathBuf;

#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error>> {
    let config = ConfigBuilder::new()
        .api_url("http://localhost:11434/v1/chat/completions")
        .model("llama3")
        .user_prompt("Your prompt")
        .input_guardrails(GuardrailProviderConfig::Regex(
            RegexGuardrailConfig {
                max_length_bytes: 1048576,
                patterns_file: Some(PathBuf::from("patterns/input.txt")),
                severity_threshold: Severity::Medium,
            }
        ))
        .build()?;

    let result = evaluate(config).await?;
    println!("{}", result.content);

    Ok(())
}

Example 3: Multiple Configs

Create environment-specific configs:

dev.toml:

api_url = "http://localhost:11434/v1/chat/completions"
model = "llama3"
temperature = 1.0  # More creative for testing

prod.toml:

api_url = "https://api.openai.com/v1/chat/completions"
model = "gpt-4"
api_key_name = "OPENAI_API_KEY"
temperature = 0.7
validate_tokens = true

[guardrails.input]
type = "hybrid"
# ... strict validation for production

Usage:

# Development
fortified-llm-client -c dev.toml --user-text "test"

# Production
fortified-llm-client -c prod.toml --user-text "production query"

Validation

Config files are validated after merging with CLI arguments. Common errors:

Missing Required Fields

Error: API URL must be provided via --api-url or in config file (--config-file)

Fix: Provide the field via CLI argument or add it to the config file. Required fields: api_url, model, system prompt (via --system-text/--system-file or system_prompt/system_prompt_file), and user prompt (via --user-text/--user-file/--pdf-file or user_prompt/user_prompt_file/pdf_file).

Invalid File Extension

Error: Config file must have .json or .toml extension

Fix: Rename file to .toml or .json

TOML Syntax Errors

Error: Failed to merge config: expected an equals, found ...

Fix: Check TOML syntax (use TOML validator)

JSON Syntax Errors

Error: Failed to merge config: expected value at line X column Y

Fix: Check JSON syntax (use JSON validator)

Best Practices

1. Use Environment Variables for Secrets

Don’t:

api_key = "sk-real-key-here"  # NEVER commit this!

Do:

api_key_name = "OPENAI_API_KEY"  # Reference env var

2. Environment-Specific Configs

Keep separate configs for different environments:

dev.toml - Local development
staging.toml - Staging environment
prod.toml - Production

3. Override for Testing

Use config for stable settings, CLI for experiments:

# Base: config.toml (temperature=0.7)
fortified-llm-client -c config.toml --temperature 0.0  # Test deterministic
fortified-llm-client -c config.toml --temperature 1.5  # Test creative

4. Document Custom Configs

Add comments to explain non-obvious settings:

# GPT-4 Turbo with 128K context window
model = "gpt-4-turbo-preview"
context_limit = 128000

# Lower temperature for factual extraction tasks
temperature = 0.3

Next Steps

Guardrails - Configure security validation
Examples - Complete config examples
CLI Usage - Override config with CLI flags

Configuration

Table of Contents

Overview

Merge Behavior

Partial Config Files

TOML Format

Basic Configuration

With System Prompt

With API Authentication

With Response Formatting

With Input Guardrails (Regex-Based)

With Output Guardrails

With LLM-Based Guardrails (Llama Guard)

Hybrid Guardrails (Defense-in-Depth)

JSON Format

Basic Configuration

With Guardrails

All Configuration Fields

Top-Level Fields

Guardrails Section

Unified Guardrails Configuration

CLI-Only Fields

Usage Examples

Example 1: CLI with Config

Example 2: Library with Guardrails

Example 3: Multiple Configs

Validation

Missing Required Fields

Invalid File Extension

TOML Syntax Errors

JSON Syntax Errors

Best Practices

1. Use Environment Variables for Secrets

2. Environment-Specific Configs

3. Override for Testing

4. Document Custom Configs

Next Steps