Microsoft cancels internal Anthropic licenses amid exploding costs. Practical guide for SMEs/mid-market companies: anticipate and control your AI budget.


In May 2025, Microsoft made a decision that shook the enterprise AI world: the outright cancellation of its internal Anthropic licenses. The reason? The shift to token-based billing caused annual budgets to explode in just a few months. Teams that had planned a comfortable AI budget for the year found themselves depleted by the third quarter.
This situation is not isolated. It reveals a structural problem that many French and Belgian SMEs and mid-sized companies are beginning to discover: the costs of using generative AI are largely underestimated at deployment time. Between commercial promises and billing reality, the gap can reach 300 to 500% depending on usage.
This article gives you the keys to understand this phenomenon, anticipate budget overruns, and implement concrete optimization strategies. Because abandoning AI is not an option, but suffering financially from it isn't either.
Most generative AI providers now charge based on usage, calculated in tokens. A token represents about 4 characters in English, less in French due to accents and special characters. Concretely, a simple 50-word question generates about 70 input tokens. The response can contain 500 to 2,000 depending on the complexity requested.
The calculation quickly becomes staggering:
An employee who uses AI 50 times per day for routine tasks can generate 500,000 tokens monthly. Multiply by 100 employees and you reach 50 million tokens per month. The monthly bill then easily exceeds EUR 5,000 for a single use case.
Beyond direct billing, several cost items fly under the radar during budget projections:
According to available information, Microsoft reportedly found that some internal teams had consumed their entire annual AI budget in less than four months. Intensive usage related to software development, code analysis, and technical documentation reportedly generated token volumes far beyond initial projections.
This mishap by a technology giant should serve as a lesson: if Microsoft, with all its expertise, was caught off guard, an SME or mid-sized company without a dedicated cloud cost management team faces an even greater risk.
Before seeking solutions, you need to measure the scope of the problem. Here's the information to collect as a priority:
At AISOS, we observe that consumption varies drastically by function. A typical audit reveals this breakdown:
For each identified usage, establish a unit cost. Example for generating a blog article:
This calculation, repeated for each process, helps identify major expense items and optimization opportunities.
Not all use cases require the most powerful model. A tiered approach can reduce costs by 40 to 60%:
Implementing an intelligent router that automatically directs queries to the appropriate model represents an initial investment that pays for itself in a few weeks.
A well-designed prompt consumes less and produces better results. Key principles:
Companies have reduced their consumption by 30% simply by training their teams in effective prompt writing.
Without control, usage naturally drifts. Put in place:
The market now offers competitive options:
A reference document should specify:
Even on a small scale, centralizing expertise allows you to:
One person dedicated 20% of their time can generate savings greater than their cost.
A use case is only justified if it creates more value than it costs. For each AI application, document:
AISOS audits regularly reveal that 20 to 30% of enterprise AI uses have negative or unmeasurable ROI. Eliminating them frees up budget for truly value-creating cases.
Good news: competition is driving prices down. Between 2023 and 2025, the average cost per token was divided by 5 for equivalent performance. This trend should continue, but with nuances:
Rather than an expensive generalist model, models fine-tuned for specific tasks will offer better performance-price ratios. Investing in customizing open source models for your recurring use cases becomes a relevant strategy.
The European AI Act is gradually coming into effect. Requirements for documentation, auditing, and traceability will generate additional costs. Better to anticipate them in your 2026 budget projections.
The Microsoft-Anthropic episode is not an anecdote: it's the signal that the era of "unlimited" AI is over. Companies that will thrive will be those that control their AI costs while extracting maximum value from these technologies.
Priority actions to launch right now:
Mastering AI costs is not a brake on innovation: it's the condition for its sustainability. Leaders who understand this in 2025 will be those who maintain an edge in 2026 and beyond.
To precisely assess your exposure to AI budget drift risks and identify your optimization levers, contact AISOS teams for a personalized diagnosis.