Open-Sourced AI Models May Be More Costly in the Long Run, Study Finds

0
10KB

As more businesses adopt AI, picking which model to go with is a major decision. While open-sourced models may seem cheaper initially, a new study warns that those savings can evaporate fast, due to the extra computing power they require.

In fact, open-source AI models burn through significantly more computing resources than their closed-source rivals when performing the same tasks, according to a study published Thursday by Nous Research.

The researchers tested dozens of AI models, including closed systems from Google and OpenAI, as well as open-source models from DeepSeek and Magistral. They measured how much computing effort each required to complete identical tasks across three categories: simple knowledge questions, math problems, and logic puzzles.

To do this, they used the number of tokens each model used to solve and answer questions as for computing resources consumed.

“Open-weight models use 1.5–4× more tokens than closed ones—and up to 10× for simple knowledge questions—making them sometimes more expensive per query despite lower per-token costs,” the study authors wrote.

Why token efficiency matters

In AI, a token is a piece of text or data—it could be a word, part of a word, or even punctuation—that models use to understand language. Models process and generate text one token at a time, so the more tokens they use, the more computing power and time a task requires.

Since most closed-source models don’t reveal their raw reasoning process or chain of thought (CoT), the researchers measured their computing efficiency by counting the tokens they used instead. Because models are billed by total output tokens used in their reasoning process and outputting the final answer, completion tokens serve as a proxy for the effort needed to produce a response.

This is an important consideration for companies using AI for many reasons.

“First, while hosting open weight models may be cheaper, this cost advantage could be easily offset if they require more tokens to reason about a given problem,” the researchers wrote. “Second, an increased number of tokens will lead to longer generation times and increased latency.”

Closed models were the clear winners

The study found that open models consistently use more tokens than closed models for the same tasks, sometimes three times as many for simple knowledge questions. The gap narrowed to less than twice for math and logic problems.

“Closed models (OpenAI, Grok-4) optimize for fewer tokens to cut costs, while open models (DeepSeek, Qwen) use more tokens, possibly for better reasoning,” the study authors wrote.

Among open models, llama-3.3-nemotron-super-49b-v1 was the most efficient, while Magistral models were the most inefficient.

OpenAI’s models were standouts as well. Both its o4‑mini and the new open-weight gpt‑oss models showed impressive token efficiency, especially on math problems.

The researchers noted that OpenAI’s gpt‑oss models, with their concise chain-of-thoughts, could serve as a benchmark for improving token efficiency in other open models.

Like
Love
Haha
3
Rechercher
Catégories
Lire la suite
News
Các bạn trẻ cân nhắc kỹ khi lựa chọn!
Nguy cơ thất nghiệp cao Theo New York Post, Ngân hàng...
Par langsirputih 2025-08-12 06:54:10 0 8KB
News
Con giáp nào may mắn vào Thứ năm, ngày 21 tháng 8, tức 28 tháng 6 nhuận âm lịch?
Đầu tiên là Ngọ Ngũ hành của ngựa vào giờ ngọ là hỏa....
Par Dumbsol 2025-08-20 08:55:06 0 8KB
Food
 Lemon Sugar Cookies 
 Lemon Sugar Cookies Ingredients:1 cup unsalted butter, softened1 ½ cups...
Par MaiPham 2025-03-14 22:43:11 0 17KB
News
Người dân không đổi Giấy phép lái xe theo mẫu mới sau ngày 1/7/2025 sẽ bị phạt đến 3 triệu đồng, đúng không?
Ngày 15/11/2024, Bộ trưởng Bộ Giao thông vận tải ban...
Par Rellannaellumi 2025-07-04 01:36:04 0 9KB
Science
The Third Known Interstellar Object Could Be Speeding Through Our Solar System at This Moment
Astronomers are scrambling to gather data on a...
Par DukeRioba 2025-07-02 21:45:03 0 10KB