13 Jun 2023
The increasing use of AI has positive implications for the semiconductor sector as the need for compute power and capital expenditure intensifies, reasons Portfolio Manager Richard Clode.
“The computer industry is going through two simultaneous transitions — accelerated computing and generative AI. A trillion dollars of installed global data center infrastructure will transition from general purpose to accelerated computing as companies race to apply generative AI into every product, service and business process.”
Jensen Huang, nVIDIA CEO
nVIDIA’s recent Q1 results and positive Q2 revenue guidance1 and subsequent stock reaction and positive news flow put the market-leading chip designer’s valuation in the rarefied air of $1 trillion market capitalisation. We think this is an appropriate time to reflect on the state of artificial intelligence (AI), its development and progress, as well as how to invest in companies that are benefiting from this critical juncture for technology. nVIDIA highlighted several salient themes that are noteworthy for active, long-term technology investors like us.
Migration to the public cloud is aggregating compute for hyperscalers on a scale not witnessed before
For decades, datacentres were the preserve of processors designed on the x86 architecture, dominated by Intel and more recently AMD. However, there are two major inflection points occurring. Firstly, the migration to the public cloud aggregates compute (computation and processing) at the hyperscalers on a scale not witnessed before. That scale, combined with the resources and technology acumen of those companies is leading them to move in two complementary directions as they focus on processing those cloud workloads more efficiently, notably around reducing power consumption, given this is one of the major costs for datacentres.
Source: Janus Henderson Investors, as at 31 May 2023. NVIDIA as at 31 May 2022. For illustrative purposes and not indicative of any actual investment.
Cloud acceleration leverages the parallel processing capabilities of graphics processing units (GPUs) or field programmable gate arrays (FPGAs) to offload compute from the central processing unit (CPU) to more power-efficient processors for the workload at hand. At the same time, hyperscalers are adopting Arm processors bringing their low-power processing, demonstrated for years in smartphones, to the datacentre via internally-designed custom semiconductors for example at Amazon, start-ups like Ampere, or the new Grace CPU launching later this year from nVIDIA.
The cloud as an enabler for AI
The second major inflection is the shift in compute workloads within the cloud to become AI centric. That is a very different workload to the traditional consumer internet workflow and consequently requires a different compute and datacentre design. So we are currently witnessing the hyperscalers speedily pivot their capital expenditure (capex) spending to the new AI age. AI training has aways utilised GPUs but generative (creation of new content) AI is also much more compute intensive on the inferencing side (running data points into an algorithm to calculate output) as well. A ChatGPT response to a query is much more computationally intensive than a Google keyword search. While Google has designed internal AI inferencing chips, to date a lot of AI inferencing had been done on x86 CPUs but that is not possible anymore for performance and cost reasons. That is also driving a shift to GPUs and custom silicon more tailored to this radically-different workload.
Implications for capital spending
The combination of the above is dramatically changing where the hyperscalers are spending their capex budgets. This is reflected by nVIDIA’s guidance for Q2 datacentre sales that were almost $4bn ahead of market expectations. To put this in perspective, Intel’s first quarter datacentre revenues were around $4bn and it made its first ever loss having lost market share and margins to AMD. Getting these tectonic shifts right can define investment returns.
Chip innovation is ramping up to provide better performance and power
As we look ahead we believe we are in the early stages of some significant wider semiconductor industry inflections. For years the exponential cost increase of maintaining Moore’s Law led to a rapidly shrinking pool of customers willing to pay up for bleeding edge (new and not fully tested) semiconductors. That trend is now reversing as AI performance requirements drive more customers to seek out the best performance and power. Taiwan Semiconductor Manufacturing Company (TSMC) has said it has twice the tape outs (chip designs ready for fabrication) on their current 3nm-class manufacturing process compared to the prior node. Many of those tape outs will be custom silicon designs from the hyperscalers. The complexities of creating such large, powerful chips is testing Moore’s Law to the limit, creating the ‘More than Moore’ trend and we are seeing incredible innovation here. The new MI300 from AMD launching later this year integrates multiple CPU and GPU chiplets, as well as high bandwidth memory via a new technology called hybrid bonding, pioneered by Besi.
Ultimately, as generative AI scales out, not all the compute can be done in centralised datacentres, therefore increasingly inference has to be done locally on edge devices, has lower latency (delays) and is better able to protect personal data. Qualcomm is currently demonstrating the ability to inference Meta’s LLaMA large language model on a smartphone. nVIDIA has also demonstrated that the complexity of AI requires a full stack (complete) solution so the innovation will not just be in hardware but also in software, for example its new Hopper chips have a transformer software engine that intelligently balances the trade-off between computational precision and accuracy to maximise the speed AI models can be trained.
Summing it all up, we believe the next major compute wave is upon us with the inflection in generative AI and the tectonic trends outlined above will create a wealth of broader investment opportunities over time across a number of companies that are well positioned to benefit.
1 nVIDIA financial results first quarter fiscal year 2024, announced 24 May 2023.
Important information
The views presented are as of the date published. They are for information purposes only and should not be used or construed as investment, legal or tax advice or as an offer to sell, a solicitation of an offer to buy, or a recommendation to buy, sell or hold any security, investment strategy or market sector. Nothing in this material shall be deemed to be a direct or indirect provision of investment management services specific to any client requirements. Opinions and examples are meant as an illustration of broader themes, are not an indication of trading intent, are subject to change and may not reflect the views of others in the organization. It is not intended to indicate or imply that any illustration/example mentioned is now or was ever held in any portfolio. No forecasts can be guaranteed and there is no guarantee that the information supplied is complete or timely, nor are there any warranties with regard to the results obtained from its use. Janus Henderson Investors is the source of data unless otherwise indicated, and has reasonable belief to rely on information and data sourced from third parties. Past performance does not predict future returns. Investing involves risk, including the possible loss of principal and fluctuation of value.
Not all products or services are available in all jurisdictions. This material or information contained in it may be restricted by law, may not be reproduced or referred to without express written permission or used in any jurisdiction or circumstance in which its use would be unlawful. Janus Henderson is not responsible for any unlawful distribution of this material to any third parties, in whole or in part. The contents of this material have not been approved or endorsed by any regulatory agency.
Janus Henderson Investors is the name under which investment products and services are provided by the entities identified in Europe by Janus Henderson Investors International Limited (reg no. 3594615), Janus Henderson Investors UK Limited (reg. no. 906355), Janus Henderson Fund Management UK Limited (reg. no. 2678531), Henderson Equity Partners Limited (reg. no.2606646), (each registered in England and Wales at 201 Bishopsgate, London EC2M 3AE and regulated by the Financial Conduct Authority) and Janus Henderson Investors Europe S.A. (reg no. B22848 at 2 Rue de Bitbourg, L-1273, Luxembourg and regulated by the Commission de Surveillance du Secteur Financier).
Outside of the U.S.: For use only by institutional, professional, qualified and sophisticated investors, qualified distributors, wholesale investors and wholesale clients as defined by the applicable jurisdiction. Not for public viewing or distribution. Marketing Communication.
Janus Henderson, Knowledge Shared and Knowledge Labs are trademarks of Janus Henderson Group plc or one of its subsidiaries. © Janus Henderson Group plc.