Page 23 - EE Times Europe Magazine – November 2023
P. 23
EE|Times EUROPE 23
The Mind-Boggling Cost of Generative AI Ownership
The latest algorithms pose
Attributes Google Search GPT-3 GPT-4
a challenge to the current
Cost Per Query ¢0.2 ~¢3 ~¢10
state-of-the-art processing
Table 2: Comparing the cost per query of GPT-3 and GPT-4 against Google search shows hardware, and GenAI
the leap in cost associated with GPT-4. (Source: Vsora)
accelerators are not
keeping up.
• Theoretical throughput of one (25 W / 0.007 queries per second)
leading-edge GenAI system processing • Total energy cost: US$0.11 per kWh
ChatGPT-4: ~0.055 queries/second • Energy cost per query: US$1.2 –4 processing hardware, and GenAI accelerators
• Number of systems needed to meet a • Total power consumption for are not keeping up. In fact, no hardware on
processing capability of 100,000 queries/second: ~363.7 MW the market today is capable of running the
100,000 queries/second: The energy cost amounts to about full GPT-4.
~1,800,000 (100,000 / 0.055) US$1 million per day (power consumption for Current LLM development efforts that focus
• Total acquisition cost: the chips × 24 hours × 0.11). on creating smaller but more specialized LLMs
~US$900,000,000,000 Clearly, the cost is dominated by hardware that can run on existing hardware are a diver-
(1,800,000 × 500,000), approaching acquisition. sion. The GenAI industry needs semiconductor
US$1 trillion innovations in computing methods and archi-
The daily depreciation amounts to about The best-guess total daily cost is in the ball- tectures capable of delivering performance of
~US$820 million (900,000,000,000 / 1,095). park of US$820 million. multiple petaFLOPS with efficiency greater than
The above leads to a GPT-4 cost per query for 50%, reducing latency to less than 2 seconds per
Estimated energy costs to execute the a system running 100,000 queries/second of query, constraining energy consumption and
hardware $US0.095 (820,000,000 / (100,000 × 24 × 60 × 60) shrinking cost to $US0.002 per query.
Assumptions: [(cost per day) / (# of queries × # of hours × # of Once this is in place—and it is only a
• Average power consumption per chip: seconds)]. Table 2 compares costs per query. matter of time—the promise of transformers’
25 W, based on nominal power, efficiency deployment on edge devices will be fully
and memory bandwidth FULFILLING THE PROMISE OF GenAI AT exploited. ■
• Throughput per chip: THE EDGE
~0.007 queries/second (0.055 / 8) The latest algorithms, such as GPT-4, pose Lauro Rizzatti is a business adviser for
• Energy consumption per query: 3,637 J a challenge to the current state-of-the-art Vsora.
GaN & SiC: Design,
Devices and the
Evolving Market
Explore the future of
power electronics with our
comprehensive report on
wide-bandgap semiconductors:
SiC and GaN.
DOWNLOAD THE EBOOK:
https://www.powerelectronicsnews.com/
special-report-gan-sic-worldwide/