AI Model Control Dashboard

Gemini 2.5 Flash

Cost-effective performance model with thinking budget control

Primary

Input Cost

$0.15 /M tokens

Output Cost

Thinking Disabled: $0.60/M

Thinking Enabled: $3.50/M

Thinking Budget

Balanced

Model Pipeline Configuration

Web Search Models (2)

SearXNG

Open-source search

Google/DuckDuckGo

Hybrid search

Analysis Models (2)

Mistral-7B

General analysis

LLaMA-2-13B

High precision

Thinking Models (2)

Falcon-180B-Chat

Deep philosophical

WizardMath-70B

Logical reasoning

Output Model

GPT4All

Local lightweight

Cost Estimation

Monthly Token Usage

million tokens

Estimated Monthly Cost: $1.50

With thinking disabled for 2M tokens

Context Memory System

Active Memory 65% used

Real-time conversation tracking

Long-term Storage 30% used

All conversation archives

Memory Assistants

Real-time Recall

Archive Manager

AI Model Control Center