📊 Full opportunity report: OpenEuroLLM. The third path. on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

OpenEuroLLM is a major European project pooling resources to build open-source multilingual LLMs. Despite progress, compute capacity remains a key bottleneck, impacting the project’s timeline and outcomes.

European AI researchers and institutions are developing a large-scale, open-source multilingual language model through the OpenEuroLLM project, but they face significant challenges in securing enough computational resources to complete the models on schedule.

OpenEuroLLM is a pan-European initiative funded with €20.6 million from the EU’s Digital Europe Programme, totaling €37.4 million, involving 20 organizations across universities, companies, and high-performance computing centers. Led by Jan Hajič at Charles University and co-led by Peter Sarlin of Silo AI (owned by AMD), the project aims to create a multilingual open-source LLM for public use, targeting 35 languages.

As of the March 2026 progress report, the project has achieved initial milestones, but the lead coordinator, Hajič, highlighted that ‘significant challenges, especially in securing more compute for creating the final models, still remain.’ The models are scheduled for delivery by July 31, 2026, but resource constraints pose risks to this timeline.

The consortium’s structure reflects a strategic response to the resource limitations faced by national projects like Italy’s Minerva and Portugal’s AMÁLIA, which are also exploring different approaches to sovereign AI development. However, Hajič’s comments reveal that even at this pooled scale, computational capacity is a limiting factor, echoing similar constraints seen in other European efforts.

OpenEuroLLM · The Third Path.
DISPATCH / MAY 2026 ESSAY · EUROPEAN SOVEREIGN LLMs · OPENEUROLLM · CONSORTIUM
▲ Standalone Essay EU Sovereign AI · Pan-EU · May 2026
Standalone Essay 03 · European Sovereign AI · The Consortium Case Study

OpenEuroLLM.
The third
path.

€37.4M EU budget, 20 organizations, four major EuroHPC supercomputers, 35 target languages. And the project’s coordinator says: “significant challenges in securing more compute still remain.”

Italy bet national. Portugal bet continuation. The EU bet consortium. OpenEuroLLM — coordinated by Jan Hajič at Charles University Prague, co-led by Peter Sarlin at AMD-owned Silo AI — is what the pan-European pooled-resources answer looks like in operational form. And the project lead is publicly stating that even at pan-European pooled scale, compute is the bottleneck. Each of the three sovereign-LLM answers, examined honestly, surfaces a complication the press coverage downplays.

▲ The structural editorial finding
The European sovereign-LLM movement’s three answers — Minerva from-scratch, AMÁLIA continuation, OpenEuroLLM consortium — are now operating at sufficient scale and duration that their structural limits are visible. None of them is the answer. Each of them is an answer. The strategic discourse benefits from treating all three as complementary data points in the same empirical experiment about what European sovereign-AI development actually requires.
— standalone essay 03 · the OpenEuroLLM case study · may 2026
€37.4M
EU consortium budget · €20.6M from Digital Europe Programme · grant 101195233
“a pittance compared with the $100B US Stargate first tranche” — Fortune · STEP Seal awarded
20
Partner organizations · 12 universities · 6 companies · 3 HPC centers
Charles University coordinator · AMD Silo AI co-lead · Mistral notably absent
4.5M+
GPU hours secured · Leonardo BOOSTER (3M) + LUMI (1.5M) + strategic across 4 EuroHPC
“significant challenges in securing more compute still remain” — Hajič, March 2026
Jul2026
First models deliverable · the strategic moment · 6 weeks from now
2 of 11 deliverables shipped · final models January 2028
OPENEUROLLM €37.4M EU BUDGET · 20 ORGANIZATIONS · CHARLES UNIVERSITY + AMD SILO AI LEADS · STARTED FEB 1 2025 HAJIČ MARCH 2026 “SIGNIFICANT CHALLENGES IN SECURING MORE COMPUTE FOR FINAL MODELS STILL REMAIN” · STRUCTURAL FINDING COMPUTE 3M GPU HOURS LEONARDO BOOSTER + 1.5M LUMI + STRATEGIC 4 EUROHPC SYSTEMS · $7B EUROHPC CONTEXT THREE-WAY MINERVA FROM-SCRATCH · AMÁLIA CONTINUATION · OPENEUROLLM CONSORTIUM · ALL THREE OPERATIONAL SUMMER 2026 YEAR ONE OUTPUTS MIXTUREVITAE · HPLT 38 REFERENCE MODELS · OPEN-SCI-REF 0.01 · TRAINING DATA CATALOGUE · MULTISYNT vs MINERVA ITALY 128 GPUS LEONARDO · €100M+ PNRR · OPENEUROLLM 4.5M GPU HOURS · €37.4M EU BUDGET · ORDER OF MAGNITUDE LARGER POOLED JULY 31 2026 FIRST MODELS · INITIAL DATASET · EVALUATION CODE · STRATEGIC MOMENT FOR EU SOVEREIGN-LLM MOVEMENT
The structural editorial anchor · Hajič’s compute statement

Even at pan-European scale, compute is the bottleneck.

From the OpenEuroLLM first-year progress report, March 6, 2026. The single most important sentence in the public documentation of the project. The pan-European consortium answer — explicitly designed as the response to individual national projects’ resource constraints — is itself constrained by the same resource that limits national projects.

Jan Hajič · OpenEuroLLM coordinator · first-year progress report
Charles University · Institute of Formal and Applied Linguistics (ÚFAL) · OpenEuroLLM coordinator · also coordinator of the HPLT (High Performance Language Technologies) project since 2022. The most quoted public statement about OpenEuroLLM’s structural constraints.
▲ On-record · OpenEuroLLM blog · March 6, 2026
Creating an open source multilingual LLM in the public space and within a large consortium is a challenging task. I am proud that thanks to the expertise, enthusiasm, commitment and hard work of especially the core partners the project has achieved its first-year goals. However, significant challenges, especially in securing more compute for creating the final models, still remain.
— Jan Hajič · Charles University · OpenEuroLLM coordinator
First-year progress and next steps · March 6, 2026
The structural significance: OpenEuroLLM has secured 3M GPU hours on Leonardo BOOSTER, 1.5M GPU hours on LUMI, and strategic compute allocations on four EuroHPC supercomputers through project end. This is real frontier-class scale. Hajič’s statement that it is insufficient for the final models means the pan-European consortium answer, as currently funded, may not produce final models at the parameter scale required to compete with US frontier developers on general capability. Position 1 (frontier-match) may need to be recalibrated to Position 2 + Position 3.
The consortium architecture · what 20 organizations actually looks like
HKUXZR C612 NAS Motherboard LGA2011-3, 10x SATA 6Gbps, 4X 2.5GbE Intel i226-V, 2X M.2 NVMe, 2X PCIe x16, DDR4, Server Workstation ITX Mainboard for Xeon E5 V3/V4 24 * 24cm

HKUXZR C612 NAS Motherboard LGA2011-3, 10x SATA 6Gbps, 4X 2.5GbE Intel i226-V, 2X M.2 NVMe, 2X PCIe x16, DDR4, Server Workstation ITX Mainboard for Xeon E5 V3/V4 24 * 24cm

【High Performance Processor Support】 Supports Intel Xeon E5-V3/V4 series processors (LGA2011-3 socket), as well as Core i7/i9 series…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

12 universities. 6 companies. 3 HPC centers. One conspicuous absence.

The OpenEuroLLM consortium combines academic NLP research, commercial AI capability, and EuroHPC supercomputing infrastructure across multiple European nations. The breadth is the strategic bet. The breadth is also the operational complication.

OpenEuroLLM consortium · 20 organizations · three categories
From the official partner list. Project coordinator Jan Hajič at Charles University Prague. Co-lead Peter Sarlin at AMD-owned Silo AI Finland. Started February 1, 2025 with EU Digital Europe Programme funding under grant agreement 101195233.
▲ COORDINATOR
Jan Hajič
Charles University Prague · Institute of Formal and Applied Linguistics (ÚFAL) · Czech computational linguist · HPLT predecessor project coordinator since 2022
▲ CO-LEAD
Peter Sarlin
AMD Silo AI · CEO and co-founder · Finnish AI lab · acquired by AMD for $665M in 2024 · brings hyperscaler-adjacent compute access and commercial discipline
▲ Universities and Research Organizations
12
Charles University Prague (coordinator) · AI Sweden · ALT-EDIC (France) · University of Tübingen · ELLIS Institute Tübingen · Fraunhofer IAIS (Germany) · Barcelona Supercomputing Center / BSC · Forschungszentrum Jülich · Eindhoven University · University of Helsinki · University of Oslo · University of Turku
▲ Companies
6
Aleph Alpha (Germany) · AMD Silo AI (Finland · co-lead) · Ellamind (Germany) · LightOn (France) · ELDA (Evaluations and Language resources Distribution Agency, France) · Prompsit Language Engineering (Spain)
▲ HPC Centres
3
CINECA (Italy) · operating Leonardo, the supercomputer that trained Minerva · CSC (Finland) · operating LUMI, one of Europe’s top supercomputers · SURF (Netherlands)
The conspicuous absence: Mistral, the French AI unicorn, is not in the consortium. From TechCrunch’s launch coverage, Hajič stated: “I tried to approach them, but it hasn’t resulted in a focused discussion about their participation.” Mistral has positioned itself as Europe’s commercial open-source alternative to US frontier developers — and its absence from the official EU sovereign-LLM consortium reflects a strategic-positioning divergence between consortium-led and commercial-led European AI development. The next standalone essay in this track examines that divergence directly.
The deliverables roadmap · 2 of 11 shipped · July 2026 is the strategic moment
Local AI on Linux in Practice: Build Private LLM Servers, GPU Workstations, Ollama Apps, Dockerized AI Services, and Self-Hosted AI Infrastructure with CUDA, ROCm, vLLM, and Open WebUI

Local AI on Linux in Practice: Build Private LLM Servers, GPU Workstations, Ollama Apps, Dockerized AI Services, and Self-Hosted AI Infrastructure with CUDA, ROCm, vLLM, and Open WebUI

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Eleven deliverables. Two shipped. Nine pending.

From the official deliverables roadmap. As of mid-May 2026, only two of eleven deliverables have shipped — both from July 2025. The July 31, 2026 cluster — first models, initial dataset, evaluation code — is when OpenEuroLLM becomes empirically comparable to Minerva and AMÁLIA.

Deliverables timeline · 11-item roadmap through January 2028
From openeurollm.eu/deliverables. Status as of mid-May 2026. Each deliverable has a defined due date and a defined scope. The July 31, 2026 cluster is the strategic moment that makes OpenEuroLLM operationally comparable to Minerva (since November 2024) and AMÁLIA (June 2026 final target).
31 Jul 2025
D3.1 · Initial training data catalogue and analytics reports
SHIPPED
31 Jul 2025
D6.1 · Communication, Dissemination and Exploitation Strategy
SHIPPED
31 Jul 2026
Initial dataset release · texts with metadata used to train OpenEuroLLM at mid-project
6 WEEKS
31 Jul 2026
First models · initial release of LLM models · tokenizers + model weights
6 WEEKS
31 Jul 2026
Evaluation Code package · Python package for model evaluation procedures
6 WEEKS
31 Jul 2027
Final dataset release · texts with metadata for final OpenEuroLLM model(s)
PENDING
31 Jan 2028
Stakeholder Report · strategic advice from OSPB and community feedback
FINAL
31 Jan 2028
Final models · final release of LLM models · tokenizers + model weights
FINAL
31 Jan 2028
LLM training report · open publishing and regulatory compliance details
FINAL
31 Jan 2028
Evaluation Report · multilingual and regulatory aspects findings
FINAL
31 Jan 2028
Evaluation Report of Communication, Dissemination and Exploitation Strategy
FINAL
For approximately six weeks between AMÁLIA’s June 2026 final release and OpenEuroLLM’s July 2026 first models, all three answers will have operational artifacts for the first time. This is the moment the structural comparison becomes empirically tractable.
The three-way comparison · the essay track closes
NVIDIA Jetson Orin Nano Super Developer Kit

NVIDIA Jetson Orin Nano Super Developer Kit

The NVIDIA Jetson Orin Nano Developer Kit sets a new standard for creating entry-level AI-powered robots, smart drones,…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Three answers. Three structural findings.

The Minerva from-scratch path. The AMÁLIA continuation path. The OpenEuroLLM consortium path. Each project surfaces an empirical complication the press coverage downplays. Each finding is harder than the framing it’s wrapped in.

Three operational answers · three structural findings
Italy’s national from-scratch investment. Portugal’s national continuation pre-training. The pan-European consortium pooled-resources approach. The strategic discourse benefits from treating all three as complementary experiments rather than competing national-prestige projects.
▲ ITALY · ESSAY 02
Minerva · national from-scratch
FundingPNRR via MUR · large national
ArchitectureFrom scratch · Mistral arch · custom IT tokenizer
Native data1.14T Italian (50%) of 2.5T total
Compute128 GPUs Leonardo · weeks
OpennessTruly-open · day one
FINDINGMinerva-3B: 4.9% on INVALSI Italian school exam · data volume + params crucial above composition alone
▲ PORTUGAL · ESSAY 01
AMÁLIA · national continuation
Funding€5.5M Portuguese gov
ArchitectureContinuation · EuroLLM-derived · inherited tokenizer
Native data5.8B pt-PT (5.5%) of 107B mid-training
ComputeNot publicly detailed
OpennessPartially open · in progress
FINDING“Fully open” claim runs ahead of release · 5.5% pt-PT in model that prioritizes pt-PT
▲ PAN-EU · ESSAY 03
OpenEuroLLM · consortium
Funding€37.4M EU · €20.6M Digital Europe
ArchitectureFrom scratch · methodology developing
Native dataTBD · MultiSynt synthetic primary
Compute4.5M+ GPU hours · 4 EuroHPC
OpennessTruly-open commitment · some EU-copyright caveats
FINDINGHajič: “significant challenges in securing more compute still remain” · pan-EU pooled still constrained

Three projects. Three findings. Each one harder than the framing it’s wrapped in. Each answer is valid for its specific positioning and resource context. None of the three is “the right answer” in the abstract. The strategic discourse benefits from treating all three as data points in the same empirical experiment.

What July 2026 will determine · three scenarios
NVIDIA DGX Spark™ - Personal AI Desktop Supercomputer – Desktop GB10 Grace Blackwell Chip

NVIDIA DGX Spark™ – Personal AI Desktop Supercomputer – Desktop GB10 Grace Blackwell Chip

Supercomputer performance directly to your desk in a compact, energy-efficient design, enabling enterprise-scale AI and high-performance computing right…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

First models in six weeks. Three scenarios.

The July 31, 2026 first-models deliverable is the strategic moment for OpenEuroLLM specifically and for the European sovereign-LLM movement broadly. Three scenarios are plausible. The structurally honest framing will require acknowledging whatever the empirical results actually show.

Three scenarios for the July 2026 OpenEuroLLM first models
In all three scenarios, the discourse that O.Carmo’s analysis of AMÁLIA modeled and that this essay track has attempted to extend is what the moment requires. Holding competing views simultaneously: the work is real AND the empirical findings are harder than the press coverage suggests. Both can be true at once.
Afrontier-match
First models are capability-competitive at their parameter scale
If OpenEuroLLM’s 8B model demonstrates competitive performance against frontier developers’ similar-scale models on multilingual benchmarks, the pan-European consortium answer is validated. Position 1 + 2 + 3 combination. The strongest outcome for the European sovereign-LLM movement broadly — demonstrates pan-European pooling produces results individual national projects cannot.
Brecalibration
First models are methodologically interesting but capability-limited
If the 8B model demonstrates strong multilingual capability but lags frontier developers on general benchmarks, the project converges toward Position 2 + Position 3 — sovereignty/openness/compliance combined with multilingual specialization. The most likely outcome given Hajič’s compute statement and the structural funding asymmetry. Strategic ambition recalibration becomes explicit.
Ccomplication
First models surface a finding that complicates the simple narrative
Each of the prior two European sovereign-LLM projects surfaced a structural finding the press coverage downplayed (Minerva’s INVALSI 4.9%, AMÁLIA’s 5.5% pt-PT share). OpenEuroLLM’s first models will likely surface their own version. Very uneven performance across the 35-language portfolio is one likely complication. Strong results for high-resource languages, weak for lower-resource. The compute statement is already one such finding.

OpenEuroLLM is one valid answer to the European sovereign-LLM question. AMÁLIA is another. Minerva is a third. Mistral is potentially a fourth — the commercial-frontier answer this essay track examines next. The strategic discourse benefits from treating all of them as complementary experiments in the same empirical question. More analysis like this is needed. Not less.

— Standalone Essay 03 · The OpenEuroLLM case study · May 2026

Implications of Compute Bottlenecks for European AI Sovereignty

The ongoing challenges faced by OpenEuroLLM highlight a fundamental issue in Europe’s pursuit of sovereign AI: the scarcity of sufficient computational resources to train large models at scale. This bottleneck could delay or diminish the impact of the project, affecting Europe’s competitiveness in AI development. The project’s progress and eventual model quality will serve as a critical indicator of whether pooled European resources can effectively support the next generation of multilingual AI models, shaping future policy and investment decisions.

European Sovereign-LLM Strategies and Resource Challenges

European efforts to develop sovereign large language models have taken three main paths: Italy’s Minerva, Portugal’s AMÁLIA, and the consortium-based OpenEuroLLM. Minerva is a from-scratch, national investment, while AMÁLIA is a continuation of Portugal’s existing models. The OpenEuroLLM project represents a collective pooling of resources across multiple countries and institutions, aiming to overcome individual resource constraints.

Launched in early 2025, OpenEuroLLM is part of a broader European strategy to foster independent AI capabilities, reduce reliance on US and Chinese models, and promote multilingual, open-source solutions. Despite initial progress, the project’s lead publicly acknowledged that resource limitations—particularly compute capacity—are a significant hurdle, a challenge common across all three approaches.

Previous projects like Minerva and AMÁLIA have demonstrated the difficulty of scaling models within limited national resources, with early findings indicating that achieving competitive performance remains challenging at current scale levels. The upcoming first models from OpenEuroLLM will be a key test of whether pooled European resources can meet the demands of large-scale multilingual model training.

“Significant challenges, especially in securing more compute for creating the final models, still remain.”

— Jan Hajič, Charles University

Unresolved Questions About Model Performance and Resources

It is still unclear whether the consortium will secure enough compute capacity to meet the July 2026 deadline for the first models. The actual performance and utility of these models once released remain to be seen, and the impact of ongoing resource constraints on model quality is uncertain. Additionally, the future participation of key industry players like Mistral remains unconfirmed, which could influence the project’s scope and success.

Upcoming Model Releases and Resource Allocation Decisions

The next critical milestone is the July 31, 2026, release of the first models. The project team will need to demonstrate that they can overcome current compute limitations to deliver functional, multilingual models. Further developments will depend heavily on whether additional funding or resource allocation can be secured, and how effectively the consortium manages existing constraints.

Key Questions

What is the main goal of OpenEuroLLM?

OpenEuroLLM aims to develop open-source, multilingual large language models for public use, leveraging pan-European resources and collaboration.

What are the main challenges faced by the project?

The primary challenge is securing enough computational resources to train the models at the desired scale and quality, which could impact the project timeline and outcomes.

How does OpenEuroLLM compare to national projects like Minerva and AMÁLIA?

Unlike Minerva and AMÁLIA, which are national efforts, OpenEuroLLM pools resources across multiple countries and institutions to address resource constraints collectively.

Will the first models be ready by July 2026?

That is the current scheduled deadline, but progress depends on overcoming resource limitations. The models’ performance and quality will be revealed upon release.

What impact could resource limitations have on European AI sovereignty?

If resource constraints delay or diminish the models’ quality, Europe’s ability to develop independent, competitive AI solutions could be compromised, influencing future strategic autonomy.

Source: ThorstenMeyerAI.com

You May Also Like

Streaming Protocols: HLS vs. DASH vs. WebRTC

Discover the key differences between HLS, DASH, and WebRTC to choose the best streaming protocol for your needs.

Vector Databases and Embeddings Explained

Great insights into vector databases and embeddings reveal how they revolutionize data retrieval—discover the secrets behind their speed and accuracy.

CTOs Are Escaping

Senior CTOs and technical leaders are shifting from traditional SaaS companies to Anthropic, seeking closer involvement with AI model development and frontier AI work.

SSD vs HDD vs NAS: The Storage Setup Creators Regret Not Doing Earlier

Aiming for optimal storage, creators often regret not combining SSDs, HDDs, and NAS earlier—discover how the right setup can prevent costly mistakes.