Red Hot Cyber
Cybersecurity is about sharing. Recognize the risk, combat it, share your experiences, and encourage others to do better than you.
Cybersecurity is about sharing. Recognize the risk,
combat it, share your experiences, and encourage others
to do better than you.
Redhotcyber Banner Sito 320x100px Uscita 101125
LECS 970x120 1
GPT-5 is here! The billion-dollar AI that devours GPUs and makes GPT-4 look like a toy.

GPT-5 is here! The billion-dollar AI that devours GPUs and makes GPT-4 look like a toy.

9 August 2025 08:55

OpenAI has launched its new flagship model, and the message is clear: reliability, power, and a radical change in the way we interact with artificial intelligence. After years of viewing language models as experimental tools, GPT-5 marks the definitive transition from prototype to production.

One of its strengths is the unified system with “smart router”: there’s no longer any need to manually choose which model to use. GPT-5 analyzes the request and autonomously decides whether to provide a rapid response or activate a more complex computation mode for challenging problems. A single interface, power on demand.

Fewer Hallucinations and More Reasoning

On the reliability front, progress is remarkable. OpenAI claims a reduction in hallucinations of between 45% and 80% compared to previous models, the result of targeted engineering work. This is a crucial step towards making AI a practical, safe, and predictable work tool.

Performance on key tasks confirms the leap in quality: in coding, GPT-5 achieves a 74.9% score on SWE-bench, placing it among the top performers in software engineering; in scientific reasoning, it shows significant progress in mathematics (AIME), science (GPQA), and multimodal perception; In writing, it demonstrates superior management of structure, rhythm, and metaphor, going far beyond simple grammatical correctness.

Finally, access and costs: GPT-5 is already available in ChatGPT (replacing GPT-4o) and via API in various formats (-5, mini, nano) with 256k input and 128k output context. The rates, up to $10 per million output tokens and $1.25 per million input, are extremely competitive, in line with models like Google Gemini 2.5. But the real breakthrough isn’t just power: it’s trust. The age of toys is over—this AI is designed to work.

OpenAI presented GPT-5 as an infrastructure model: reliable, powerful, and designed for professional use. Now let’s add a surprising aspect: its development would have required a billion-dollar investment and unprecedented hardware infrastructure, confirming that this is no longer a prototype, but a full-fledged undertaking.

Record figures for computing power, parameters, and therefore consumption

GPT-4 is estimated to have cost around $63 million to train, using 25,000 Nvidia A100 GPUs for 90-100 days. By comparison, estimates for GPT-5 suggest a cost of $1.25 to $2.5 billion and an infrastructure of 250,000 to 500,000 dedicated Nvidia H100 GPUs. Alternatively, according to other calculations based on market estimates (parameters of 8.8-15 trillion and training durations around 150 days), between 150,000 and 260,000 H100 GPUs would be needed, for a hardware cost of between $3.7 and $6.6 billion just to purchase the cards.

While official details on GPT-4’s parameter count have not been made public, sources estimate it could reach 1.8 trillion parameters. GPT-5, on the other hand, is projected to have between 8.8 and 15 trillion parameters, which is about 5-8 times larger than its predecessor.

The required infrastructure is not only expensive, but It’s also logistically challenging: hundreds of thousands of H100 GPUs represent a massive demand for both energy and physical space. If GPT-4 had already marked a turning point in resource use, GPT-5 pushes it to near-industrial levels, effectively embodying the definitive shift from experimentation to AI infrastructure.

From a practical standpoint, GPT-5 promises far greater power and reliability than GPT-4. However, the high cost and enormous hardware requirements shift the focus: we are no longer in the realm of experimentation, but in a new era in which AI is infrastructure—expensive, widespread, and ready to serve mission-critical applications. The age of toys is over: AI is now designed for work.

Follow us on Google News to receive daily updates on cybersecurity. Contact us if you would like to report news, insights or content for publication.

Immagine del sito
Redazione

The editorial team of Red Hot Cyber consists of a group of individuals and anonymous sources who actively collaborate to provide early information and news on cybersecurity and computing in general.