Türkiye’s AI model Cosmos T1 outperforms larger global models in Turkish math reasoning
A domestically developed artificial intelligence model in Türkiye has achieved a major milestone in Turkish-language mathematical reasoning, outperforming significantly larger international systems in benchmark testing.
Cosmos T1, a 9-billion-parameter AI model developed by the Cosmos research team at Yıldız Technical University (YTU), reached a 77.41% accuracy rate on the GSM8K benchmark, a widely used test that measures mathematical reasoning capabilities in large language models.
According to the research team, the model demonstrated superior performance compared to substantially larger systems, including Meta’s 70-billion-parameter Llama-3.1-70B, which recorded 66.13% accuracy, and Google’s Gemma-2-9B model, which achieved 63.10% accuracy on the same test.
The results indicate that Cosmos T1 is capable of competing with models nearly eight times its size in parameter count, particularly in Turkish-language reasoning tasks.
The project was led by Prof. Dr. Mehmet Fatih Amasyalı of YTU’s Department of Computer Engineering and the Department of Artificial Intelligence and Data Engineering. Researchers stated that Cosmos T1 distinguishes itself through advanced Turkish-language “chain-of-thought” reasoning, enabling the model to articulate intermediate reasoning steps before generating final answers.
Amasyalı noted that the model was developed based on Google’s Gemma 2 architecture, with extensive enhancements focused on Turkish linguistic performance. Unlike conventional question-answer systems that generate direct outputs, Cosmos T1 operates in a two-stage process: structured reasoning followed by answer generation.
The model has been released as open-weight, allowing institutions and companies to download and deploy it on-premise. This approach is considered particularly valuable for sectors with strict data governance requirements, such as healthcare and defense, where external data sharing may be restricted.
Users can access Cosmos T1 either through an online platform or by integrating the downloadable model weights into their own infrastructure. The development process was supported under TÜBİTAK-funded research projects, including financial backing for Turkish-language optimization.
Researchers stated that the model underwent intensive training to transition from a conventional “non-reasoning” architecture to a reasoning-enabled system, a transformation that significantly improved its performance. The achievement has also generated considerable attention on social media platforms.
The advancement marks a notable step in Türkiye’s expanding AI ecosystem and highlights the growing role of localized language optimization in global AI competitiveness.(ILKHA)
LEGAL WARNING: All rights of the published news, photos and videos are reserved by İlke Haber Ajansı Basın Yayın San. Trade A.Ş. Under no circumstances can all or part of the news, photos and videos be used without a written contract or subscription.
A new assessment published in the journal Nature has raised concerns that advances in artificial intelligence could lower barriers to the development of biological weapons, as AI-powered biotechnology tools become increasingly capable of designing and analyzing complex biological systems.
China launched a new communication test satellite on Thursday, marking a key step in the country’s efforts to advance next-generation orbital communication technologies.
Scientists have discovered what is believed to be the world's deepest and largest collection of whale remains on the floor of the southeastern Indian Ocean, uncovering both ancient fossils and active whale-fall ecosystems that have existed for at least 5.3 million years.
Safety concerns surrounding the V-BAT military drone developed by U.S.-based defense technology company Shield AI have resurfaced after a Romanian naval officer suffered severe injuries during a training exercise in the United States.