ComparIA Svelte Themes

Comparia

Open source LLM arena created by the French Government

Open Source LLM Arena

Collect human preference datasets for less-resourced languages and specific sectors,
while raising awareness about model diversity, bias, and environmental impact.


Built by the French government, now growing into new languages and sectors.

๐Ÿ‡ซ๐Ÿ‡ท French platform  ยท   ๐Ÿ‡ฉ๐Ÿ‡ฐ Danish platform

Supported by DINUM, Ministry of Culture, ALT-EDIC, Denmark, and recognised as a Digital Public Good


How does it work?

flowchart LR
    U["๐Ÿ‘ค Ask"] --> A["๐Ÿค– Compare"] --> V["๐Ÿ—ณ๏ธ Vote"] --> R["๐Ÿ” Reveal"]

    R --> L["๐Ÿ† Leaderboard"]
    R --> T["๐Ÿง  Rare data for model training"]
    R --> M["๐Ÿ—บ๏ธ Use case mapping"]
    R --> E1["๐Ÿ’ก Model diversity"]
    R --> E2["โš–๏ธ Bias awareness"]
    R --> E3["๐ŸŒฑ Env. impact"]

    style U fill:#f0f4ff,stroke:#3558a2
    style A fill:#f0f4ff,stroke:#3558a2
    style V fill:#f0f4ff,stroke:#3558a2
    style R fill:#f0f4ff,stroke:#3558a2
    style E1 fill:#e8f5e9,stroke:#388e3c
    style E2 fill:#e8f5e9,stroke:#388e3c
    style E3 fill:#e8f5e9,stroke:#388e3c
    style L fill:#fff3e0,stroke:#e65100
    style T fill:#fff3e0,stroke:#e65100
    style M fill:#fff3e0,stroke:#e65100

๐ŸŸฆ User journey    ๐ŸŸฉ Awareness value    ๐ŸŸง Dataset value


๐Ÿ‡ซ๐Ÿ‡ท The French use case

Launched in October 2024 by DINUM and the French Ministry of Culture to address the lack of French-language preference data for LLM training nd evaluation.

Since launch: 600,000+ prompts, 250,000+ preference votes, 300,000+ visitors. One of the largest non-English human preference datasets available. All data published openly on Hugging Face:

We published a pre-print to dive deep into the project's strategy in France.

Compar:IA featured on France 2 news, being used in a classroom

Compar:IA on the France 2 evening news, used in the classroom to teach students about AI models, bias, and environmental impact.


For whom?

๐ŸŒ Languages

Most LLMs underperform outside English. Compar:IA collects the preference data needed to close this gap.

Already live in French and Danish, and planning launches in Sweden, Estonia and Lithuania.

๐Ÿ›๏ธ Sectors

Generic benchmarks miss domain-specific needs. A sector arena reveals which models handle specialised language best.

Healthcare, legal, education, public admin, agriculture...

๐Ÿข Organisations

Run your own arena, evaluate models on your real-world tasks, and contribute data back to the commons.

Governments, universities, hospitals, companies, NGOs...


Benefits

๐Ÿ’ก Raise awareness

Teach citizens and professionals about model diversity, bias, and environmental cost. Already used in schools and training sessions.

Blind comparison between two models

๐Ÿ“Š Generate rare datasets

Produce instruction and preference data in less-ressourced languages.

Dataset analysis visualization

๐Ÿ” Downstream reuse

Data feeds into new model training, leaderboards, use case mappings, and other research topics.

Downstream data analysis

Interested in an arena for your language, sector, or organisation?

The platform is fully open source, self-hostable, and customizable: choose your models, translate the interface, adapt prompt suggestions, add your logo. We can host it for you or help you set it up yourself.

Whatever your situation, reach out first and we'll figure out the best path together.

๐Ÿ“ฌ [email protected]


Contribute, we need you ๐Ÿค

Compar:IA is a digital common. Whether you can offer funding, code, translations, or simply ideas, there is a place for you.

๐Ÿ’ฐ Financially. Compar:IA has been funded by DINUM and the French Ministry of Culture, with European support from ALT-EDIC. We are actively looking for new partners and funders to sustain the infrastructure, expand to new languages, and keep the project independent. [email protected]

๐Ÿ’ป In code. The entire platform is open source and we welcome contributions of all sizes: bug fixes, new features, translations, documentation. Come build with us. GitHub repository

๐Ÿ’ฌ In discussions. Share your ideas, flag issues, or just ask questions on GitHub Discussions. We want to hear from you. GitHub Discussions

Any other way. Partnerships, academic collaborations, media coverage, spreading the word: every contribution matters. Reach out and let's talk. Contact us


Roadmap

๐ŸŸข In Progress

  • EcoLogits update #253 (๐Ÿ‡ช๐Ÿ‡บ ALT-EDIC, ๐Ÿ‡ซ๐Ÿ‡ท DINUM)
  • Gradio โ†’ FastAPI migration (๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture, ๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ช๐Ÿ‡บ ALT-EDIC)
  • Language/platform-specific model support (๐Ÿ‡ช๐Ÿ‡บ ALT-EDIC, ๐Ÿ‡ซ๐Ÿ‡ท DINUM)
  • Dataset publication pipeline configurable per language/platform, with customizable publication delays and anonymization pipelines (๐Ÿ‡ช๐Ÿ‡บ ALT-EDIC, ๐Ÿ‡ซ๐Ÿ‡ท DINUM)

๐Ÿ”ฎ Up Next

  • Web search and document upload
  • Authentication
  • Style control #273
  • Ranking consolidation and internationalization
  • Message history
  • Easier deployment and streamlined onboarding
  • Improved anonymization pipeline
  • Live use-case mapping

โ›ต Shipped

  • Dataset publishing pipeline v1 (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture)
  • Leaderboard v1 (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture, in collaboration with ๐Ÿ‡ซ๐Ÿ‡ท PEReN)
  • Archived models (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture)
  • Blog section (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture)
  • Internationalization foundations (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture)
  • compar:IA v1 (๐Ÿ‡ซ๐Ÿ‡ท DINUM, ๐Ÿ‡ซ๐Ÿ‡ท Ministry of Culture)

๐Ÿ‘‰ Full technical roadmap on GitHub


Getting started

The platform is fully open source and self-hostable. The quickest way to get running:

cp .env.example .env       # Configure environment
make install               # Install all dependencies
make dev                   # Start backend + frontend

For the full setup guide (Docker, manual setup, testing, database, models, i18n, architecture), see CONTRIBUTING.md.

Digital Public Goods Badge

Top categories

Loading Svelte Themes