<\/i>","library":"fa-solid"},"layout":"horizontal","toggle":"burger"}" data-widget_type="nav-menu.default">

Top

Counterpoint Research Weekly Newsletter

Weekly Newsletter
September 14, 2023

What Were the Biggest Changes Coming With iPhone 15?

Sign Up for Our Newsletter Here

Receive our insightful weekly newsletter and stay ahead of the competition.

Fill in the form to subscribe to our newsletter.

Meet Counterpoint at DTW23 Ignite

Counterpoint Research is pleased to announce its participation as a Media Partner inDTW23 Ignite我们的副主任,Mohit Agrawal, will be present. You can schedule a meeting with him to discuss the latest trends in the IoT sector and understand how our leading research and services can help your business.

[one_half填充= " 0 0 px 0 0 "]

When:19-21 September, 2023

[/one_half]

[one_half_last padding=”0 0px 0 0″]

Where:Copenhagen

[/one_half_last]


[two_third padding=”0 20px 0 0″]

Click here(or send us an email at contact@www.arena-ruc.com) to schedule a meeting with him.

[/two_third]

[one_third_last]

SCHEDULE A MEETING

[/one_third_last]


About the Event:

The telco and cloud worlds are colliding, driving rapid service innovation, upending traditional business models, and ushering in exciting new opportunities for growth. This groundbreaking transformation is being enabled by AI, autonomous networks and open digital architectures.

TM Forum is leading the industry in defining the building blocks for new operating models, impactful new partnerships, and advanced software platforms, unlocking the value of data to create nearly endless opportunities for players across the communications ecosystem.

DTW23 – Ignite will harness the collective ingenuity, innovation, and collaboration of global community to seize the opportunity and lead the communications industry to growth.

Read more about DTW23 Ignite

相关的帖子

Podcast #69: ChatGPT and Generative AI: Differences, Ecosystem, Challenges, Opportunities

Generative AI has been a hot topic, especially after the launch of ChatGPT by OpenAI. It has even exceeded Metaverse in popularity. From top tech firms like Google, Microsoft and Adobe to chipmakers like Qualcomm, Intel, and NVIDIA, all are integrating generative AI models in their products and services. So, why is generative AI attracting interest from all these companies?

而生成AI和ChatGPT都用于generating content, what are the key differences between them? The content generated can include solutions to problems, essays, email or resume templates, or a short summary of a big report to name a few. But it also poses certain challenges like training complexity, bias, deep fakes, intellectual property rights, and so on.

In the latest episode of ‘The Counterpoint Podcast’, hostMaurice Klaehneis joined by Counterpoint Associate DirectorMohit Agrawaland Senior AnalystAkshara Bassito talk about generative AI. The discussion covers topics including the ecosystem, companies that are active in the generative AI space, challenges, infrastructure, and hardware. It also focuses on emerging opportunities and how the ecosystem could evolve going forward.

Click to listen to the podcast

Click here to read the podcasttranscript.

Podcast Chapter Markers

01:37 –Akshara on what is generative AI.

03:26 –Mohit on differences between ChatGPT and generative AI.

04:56 –Mohit talks about the issue of bias and companies working on generative AI right now.

07:43 –Akshara on the generative AI ecosystem.

11:36 –Akshara在中国公司正在做的事情AI space.

13:41 –Mohit on the challenges associated with generative AI.

17:32 –Akshara on the AI infrastructure and hardware being used.

22:07 –Mohit on chipset players and what they are actively doing in the AI space.

24:31 –Akshara on how the ecosystem could evolve going forward.

Also available for listening/download on:

apple podcasts logo spotify tune in radio google podcasts logo

AI Chip Market: Advanced Packaging Capabilities Key Differentiating Factor

  • The requirement of computing power, latency and higher bandwidth can be met with the combination of advanced packaging and wafer fabrication techniques.
  • One of the integration techniques widely used is CoWoS (chip-on-wafer-on-substrate), a 2.5D IC integration technology that integrates logic computing and HBM chips.
  • As AI becomes more pervasive across devices and industries, vertical players such as Apple, Tesla, Google and Meta are adding tons of AI capability in their offerings and would also demand integrated AI compute for their portfolios. CoWoS seems the best bet for now.

Advanced packaging offers a good lever to enhance overall chip performance beyond traditional geometric scaling on transistors, and extend Moore’s Law over the next decade. Advanced packaging is categorized as front-end 3D which stacks chips or wafers vertically and backend 2.5D CoWoS that interconnects dies horizontally via RDL (redistribution layer) or interposer.

HBM going mainstream with demand for AI applications

The requirement of computing power, latency and higher bandwidth can be met with the combination of advanced packaging and wafer fabrication techniques, especially for high-performance computing chips.AIaccelerators used to train AI models in data centers require the highest memory bandwidth available. Unlike earlier systems, which were memory-constrained, current data center architectures use a variety of techniques to overcome memory bottlenecks. One of the solutions widely implemented to boost bandwidth and memory capacity is high bandwidth memory (HBM). AI has been a big driver of HBM inGPUseven though traditionally it has been a niche technique.

HBM technology works by vertically stacking DRAM chips on top of one another and interconnected through TSVs (Through Silicon Vias) and micro-bumps. For AI training and high-performance applications, HBM can deliver terabytes per second which is significant processing power required for AI and high-performance computing applications as the DRAMs are stacked. While HBM offers extreme bandwidth for the off-chip memory needed for data center AI accelerators, tradeoffs include HBM’s cost and thermal limitations. HBM is becoming more mainstream, with advances in the area, performance improvement and reduced power consumption with every iteration, as the DRAM stack and theSoCare placed in a single package substrate, driving its adoption in AI applications.

CoWoS major driver for packaging advanced logic with HBMs

One of the integration techniques widely used is CoWoS (chip-on-wafer-on-substrate), a 2.5D IC integration technology that integrates logic computing and HBM chips by mounting them on a silicon interposer and then placing them directly on a packagesubstrate. TSV/RDL interposers are for extremely fine-pitch, high-I/O, high-performance and high-density semiconductor IC applications.

The logic and HBMs are first bonded side by side on the silicon interposer to form chip-on-wafer(CoW) with fine pitch and high-density interconnect routing among the devices. Each HBM consists of DRAMs with micro-bumps and a logic base with TSVs straight through them. Through-Silicon Via (TSV) is the feature that enables 2.5D and 3D advanced packaging. TSVs are electrical connection pathways that are short vertical columns running through the silicon wafer or die and enable smaller package sizes and denser interconnects, improve electrical performance by shortening the electrical distance traveled, and enable stacking of multiple chips used in products like HBM. Finally, the TSV interposer’s assembly with larger bumps on a package substrate is done.

AI chip market - CoWoS integration process
Source: Counterpoint Research

CoWoS also a bottleneck for industry

Over the years, CoWoS technology development has focused on supporting increasingsiliconinterposer dimensions to support processors and HBM stacks in the overall package. Today, CoW with C2 bump by TCB method is the most used assembly method for silicon-to-silicon flip chip bonding. CoW flip chip by a bumpless technique called the hybrid bonding method, currently in R&D, will gain traction over the years.

CoWoS has been the main bottleneck for AI shipments recently. Increasing CoWoS capacity to fulfill the rising demand, driven byNVIDIA,AMD,BroadcomandAmazon, has been the focus.

Addressing challenges to CoWoS growth

To address these challenges, existing players need to diversify the supply chain by increasing capacity and collaboration with players in the value chain – OSATs, interposer manufacturers, and packaging equipment vendors.

What should pure-play foundries do?

TSMC has capacity constraints with respect to CoWoS creating some bottlenecks for NVIDIA and other customers. One of the reasons here is the bottleneck from equipment suppliers and manufacturing interposers. Therefore,TSMCshould look to collaborate and potentially outsource to its partners, which will help stabilize volume in the near- to mid-term and meet the growing demand for advanced packaging in AI and HPC applications. So, long term, TSMC should collaboratively work with equipment manufacturers to optimize the CoWoS processing techniques such as TCB (Thermal Compression Bonding) and hybrid bonding.

What should equipment vendors do?

Equipment players should collaborate closely with foundries and co-develop optimized processes to scale the CoWoS packaging. For example,AppliedMaterials is actively developing new technologies for hybrid bonding and TSV to advance heterogeneous chip integration that help chipmakers integrate chiplets into advanced 2.5D and 3D packages. Similar collaborations would be expected from other players such as KLA Tencor, Lam Research, ASMPT and BESI to drive advancements in CoWoS packaging or other competing techniques such as hybrid bonding techniques to enable improvements in fine pitch, I/O density and power consumption. Research collaborations will provide semiconductor and systems companies with a complete suite of tools and technologies for developing and prototyping various packaging designs and enable continued advances in power, performance, area, cost and time-to-market.

What should IDMs do?

Samsung and other IDMs are also actively looking to take a share of the advanced packaging market and enhance their design and manufacturing capabilities to establish a sufficient customer base for high-end solutions. For example, Samsung Electronics’ HBM-PIM technology integrates processing chips and advanced memory chips like HBM with its proprietary 2.5D or 3D packaging technology. Samsung has developed HBM-PIM as a potential replacement for its existing HBM2 product where the bottom four of eight memory chips are replaced with chips containing both DRAM and compute cores. By doing some of the compute in the DRAM, the volume of data that needs to go to the processor decreases, effectively speeding up the neural network and saving the power needed to transport data. As this technology develops, Samsung will be able to leverage its advanced packaging solutions and technology as it is the only company that has a chip business portfolio spanning memory chips to contract-based processing chip fabrication and packaging and will be able to increase its market share and compete with TSMC.

Advanced packaging is also an opportunity for IDMs that have mature node capability to enhance the performance of their existing product portfolios and capture significant market share.

What should OSATs do?

OSATs like ASE and Amkor are actively investing to expand advanced packaging offerings but it will take time to benefit from the uptrend. Yield and volume output are the keys for foundries to grow their advanced packaging platform. This also poses a barrier for most traditional OSAT companies to enter this area. However, they could collaborate with leading players and increase their capacity in back-end processes. Another option could be to increase investment in R&D for advanced packaging applications in mature nodes, which will help prepare for the next upturn, like moving away from flip chip and wire bonding.

OSATs will face rough weather if they don’t join the wave of advanced packaging to achieve PPA benefits in mature nodes. But they can get a first-mover advantage if they start investing now, especially inAIapplications in theIoT,automotiveand network segments.

What should fabless vendors do?

NVIDIA accounts for the majority of the current CoWoS capacity. Getting the high performance required for AI workloads is possible because of the benefits being derived from CoWoS packaging. The technology offers flexibility in terms of a wide range of interposer sizes, package sizes and a number of HBM cubes, and provides the best performance with the highest integration density required for high-performance computing applications.

As AI becomes more pervasive across devices and industries, vertical players such as Apple, Tesla, Google and Meta are adding tons of AI capability in their offerings and would also demand integrated AI compute for their portfolios. CoWoS seems the best bet for now. As other competitors including Intel expand their offerings to capture the AI demand, they would have to match the best in class, which as of now is provided through CoWoS.

From the client compute perspective, as the industry develops scaled-down AI models capable of running on smartphones and users demand AI models on smartphones, it might experiment with CoWoS. However, implementing CoWoS packaging in smartphone applications at present will not be cost-effective and alternative packaging techniques will be a better option to consider.

Related links:

Artificial Intelligence: Irrational Exuberance is in Full Swing

ABF Substrate Demand Likely to Recover in H2 2023, BT Substrate Demand to Remain Weak

Currency Fluctuation Limits Global Wafer Fab Equipment Revenue Growth to 9% YoY in 2022

Wafer Fab Equipment Revenue Tracker

Applied Materials FY 2022 Earnings Report

Applied Materials Delivers Strong FY 2022 Numbers Despite Headwinds

Data Center CPU Market: AMD Surpasses Intel in Share Growth

Global Data Center CPU Revenue Tracker: Q1 2018 – Q4 2022

How Far Has Technology Come in 20 Years?

Stable Diffusion Image of mixed technology2
Source: Created with Stable Diffusion

Twenty years ago, I was an equity analyst for a Wall Street investment bank. At the time, my research director liked to get all the analysts to write occasional thought pieces. In the following article written in June 2003, I chose to write a speculative piece that looked back to 2003 from five years in the future, i.e. 2008. I speculated that there would be quite a few technological leaps in the five intervening years.

Given the 20 years that have now passed since I wrote the article, how many of those technologies have actually come into being? As you will see, not many, while others that were not foreseen have matured – for example, app-based smartphones and music streaming.

Without specifically naming it as artificial intelligence, I foresaw a role for cloud-based intelligent software agents that would provide intuitive assistance in multiple situations – a true digital assistant. These have not come into being and they are not even much discussed. We do have digital assistants such as Apple’s Siri, Google Assistant or Amazon’s Alexa, but they are mostly incapable of anything more than answering simple questions and certainly couldn’t be trusted to book travel tickets, make restaurant reservations or update other people’s diaries. WhileChatGPTand derivatives ofLarge Language Modelsseem superficially smarter, they are still not yet at the stage of being able to function as a general assistant.

One other technology referenced in the article that is still far from maturity, is augmented reality. The glasses described were not too far-fetched – Microsoft’s HoloLens can achieve some of what is described and Epson and Vuzix, for example, have developed glasses that are in use by field service engineers. But these products are not able to reference real-world objects.Apple’s forthcoming Vision Pro, while technically brilliant, would not be a suitable solution for the use case described.

At the end of the article, I listed companies that I expected to be playing a significant role in the development of the various technologies highlighted. But where are those companies now?

For context, and for the younger readers, around the turn of this century, third-generation cellular licenses had been expensively auctioned in several countries and many mobile operators were struggling to generate a return on their investment. Oh, how things have changed (or not)! As an analyst covering mobile technology, I could see that investors were valuing mobile operators solely on their voice and text revenues, with zero value being ascribed to future data revenues. My article was also an attempt to awaken investors to the potential value beyond voice.

Anyway, here’s the report that I wrote in mid-2003. It was written as though it were an article in a business newspaper.

Special Report – June 2008

Connected People

It is just eight years since European wireless telecom companies became the subject of outright derision for spending billions of dollars on licenses to operate third-generation cellular networks. Now the self-same companies have become core to our everyday existence. Their stock, which bottomed in the middle of 2002, has risen steadily ever since.

The original promise of 3G technology was high-speed data networking coupled with an exceptional capacity for both voice and data. But critics said that it was an innovation users didn’t need, want or would be willing to pay for.

When the first commercial 3G networks appeared in 2003 and faltered at the first step, the doubters started to look dangerously like they had a point. But the universe is fickle and within the last two or three years, the combination of maturing networks and the inevitable power of Moore’s Law has started to deliver wireless devices and applications that would have been thought of, if not as science fiction, then at least science-stretching-the-bounds-of-credibility, when the licenses were issued.

However, while the long-time infamy of 3G means it is taking the starring role as industry watchers chart the chequered history of the technology, it is the supporting cast of technologies that has really delivered the goods. Without them, 3G would have remained just another method to access the backbone network.

The following snapshots from one perfectly ordinary day last month show how the coordinated application of a whole slew of technologies has subtly but distinctly altered our lives.

Bristol – May 1, 2008, 12:57 pm

Beads of sweat form on the face of Jim McKenna, a 24-year-old technician, as he studies the guts of a damaged generator. McKenna is a member of a rapid response team, looking after mission-critical power generation facilities across Southern England.

“Dave, I’ve located the damaged circuits, I think I can repair it, but the control unit is non-standard and I’ve not seen one like it before. Can you help me out here?”

McKenna’s voice is picked up by a tiny transducer microphone embedded in a Bluetooth-enabled hands-free earbud. The bud is so small it nestles unobtrusively in the technician’s ear. The earbud is wirelessly connected to the small transceiver on McKenna’s belt. His voice activates a ‘push’-to-talk connection to his controller in the Scottish technical support center. The word push is in quotes because it is his voice that effects the push, leaving McKenna’s hands entirely free.

In the Edinburgh-based command center, David Sanderson, an experienced engineer, maximizes the image from one of a half-dozen sub-screens that compete for his attention. Each screen shows live pictures from his team of technicians with data about their location and degree of job completion.

Sanderson taps the screen again and, 400 miles away in Bristol, a tiny camera on McKenna’s smart glasses zooms in on the generator specification plate. Sanderson peers intently at the screen:

“我看到一个代码在侧板上。我已经强调it for you. Can you scan it? I can then pull the circuit files for you”.

Seemingly in mid-air, a red circle appears around a barcode away to McKenna’s right. The heads-up display in McKenna’s glasses maintains a fix on the code even though he moves his head. He leans across and uses the camera to scan the code, which is instantaneously transmitted back to Edinburgh where the circuit plans are uploaded from the database. Sanderson extracts the relevant section before speaking again to McKenna.

“Jim, I’m initiating the synchronization, you should have it in a few seconds.”

The 3G transceiver on Jim’s belt receives the information and immediately routes it to his smart glasses via Bluetooth. As Jim looks at the damaged circuitry, the heads-up display begins to superimpose the circuit diagram over the actual circuits, adjusting for size. He spends a few minutes comparing the damaged circuits with the schematic images. He calls for more backup.

“Dave, the problem is definitely in this sector of the step-down circuit,” McKenna points to a series of circuit boards, “is there a suggested workaround in the troubleshooting file?”

Within minutes the heads-up display starts guiding McKenna through a series of measures that isolates and bypasses the damaged circuits. Within 20 minutes, McKenna successfully reboots the system – power is restored.

Five years ago, very little of the above could have been done as efficiently and intuitively. Field service engineers needed substantial experience to tackle complex tasks – they also had to carry heavy, often ruggedized PCs and a whole series of manuals on CD-ROMs. Technical backup, where available, was a cellular voice call.

Liverpool Street, London, May 1, 2008, 2:32 pm

Joanne King, an equity analyst, is meeting a buy-side client. As they settle into the soft leather chairs of the meeting room, she slides a flexible plastic sheet across the table. The sheet is printed with electronic ink. The latest marketing pack was downloaded to her mobile terminal on the way over in the taxi. She taps the screen of her smartphone and the slide set appears on the sheet. As Joanne and her client discuss the vagaries of the stock market, they are able to use virtual tabs to flip between ‘pages’ within the pack. When the client requests more information on the balance sheet of one of companies they’re discussing, Joanne is able to pull down the necessary information, adding it to the slide set.

Partway through the discussion, Joanne hears a subtle tone in her ear indicating an urgent communication request from her personal digital assistant. She apologizes to the client before initiating the communication path. “Wildfire, what’s the problem?” she knows that Wildfire will only override her no-interrupt rule if an issue requires immediate attention.

“An air traffic control strike in Paris has disrupted all flights. Your 6 pm Brussels flight is showing a two-hour delay and may be canceled. The best alternative is to take the Eurostar train. Services leave at 16:30 and 18:30.”

After a moment’s thought, Joanne comes to a decision: “Book the 16:30, please.” Conscious of the topics still to cover in her meeting, she adds, “Can you also have a taxi waiting when I am through here?”

野火确认指令和下降into meeting mode. Joanne apologizes to the client and resumes her meeting. Meanwhile, Joanne’s software agent communicates with various travel services, canceling her flight reservation and booking the rail service.

Having learned from Joanne’s prior behavior, the agent books a First Class seat in a carriage toward the front of the train. The agent also communicates with a taxi firm – a car will be waiting when her meeting is completed. The agent is authorized to spend money within predefined limits. Simultaneously, the agent modifies Joanne’s expense report and calendar.

Joanne’s dinner date with friends in Brussels will be hard to keep given the change in travel plans. The agent negotiates with the diaries of her three dinner guests and the reservation computer at their chosen restaurant. A new reservation is agreed and four diaries are updated accordingly.

At the conclusion of her meeting, Joanne leaves the slide set contained in the pre-punched flexible display. Her client will be able to store it in standard folders and refer to it at leisure. Solar cells ensure that there is enough power to display the material without having to worry about battery charge.

她走向出租车,乔安妮的位置感知PDA recognizes she is in motion and, therefore, ready to communicate. “Joanne, you have 2 voice messages, 23 business e-mails and 12 personal e-mails. How would you like me to handle them?” Joanne chooses to listen and respond to a voicemail on the short taxi ride to Waterloo, deferring the e-mails for the train.

Once in her seat on the Eurostar train, Joanne unfolds a screen and keyboard that work alongside her 3G smartphone. Bluetooth provides the link between the smartphone, screen and keyboard. The Light Emitting Polymer screen is extremely lightweight and flexible, yet delivers high contrast and color resolution. Power consumption is low.

琼花一个小时回复电子邮件fore kicking off her shoes and taking out an e-book to settle down to listen to some music. She is particularly looking forward to a new album she bought on the way to the station. A song she was unfamiliar with came over the radio in the taxi – loving it, but not knowing what it was, Joanne recorded a quick burst. Vodafone, her service provider, was able to identify the music and offered to sell her the single or album. In anticipation of her long train ride, she chose the album. Leaning back in her seat, she lets the cool beats ease her to Brussels.

In 2003, one-on-one presentations were either made from a PC screen or delivered on regular paper. Meeting interruptions were either obtrusive or impossible, and changing travel reservations on the fly typically required several people – often with intervention by the traveler herself. Meanwhile, mobile e-mail was possible but only on large-screen PCs, compromised by size, weight and power consumption, or devices with screens and keyboards too small for anything other than limited responses.

Hyde Park – May 1, 2:18 pm

Mike Lee is on his way home from high school. He flips his skateboard down three steps and dives for cover in the bushes, the sound of gunfire ringing in his ears. Peering through the leaves, he holds a small flat panel console in front of him. He scans through 120 degrees, concentrating on the screen. The intense rhythms of electro-house are now the loudest sounds he hears, but there is also the distant rap of gunfire. On the screen, he sees the surrounding park, but in addition, the occasional outlandish figure appears, flitting between hiding places among the trees. “Josh! Where are you?” Mike demands in an urgent whisper.

“I’m by the lake dude. Surrounded. Can you get down here? I’m running out of ammo.”

Mike swings around, looking toward the lake through his device. He sees Josh’s position highlighted on the screen. He turns back, takes a deep breath and starts jabbing buttons on his device. Explosions and smoke fill the screen. Then running to the path, he jumps back on his skateboard and carves down the hill to the lake, pitching into the shrubbery next to his buddy Josh. They proceed to engage the advancing enemy in a frenzy of laser grenades, gunfire and whoops of delight.

After a few minutes, they both hear the words they have been waiting for, “Well done men, you have completed Level 12. Hit the download button to move on to the next level.”

Mobile gaming, even as recently as 2003, offered a relatively poor user experience. Simple Java games were the norm. Games now not only involve online buddies but they are also immersed into the surrounding environment, massively enhancing the experience.

3G has come a long way from its ignominious start. However, the real catalyst that has made it a life-changing technology has been the incredible range of diverse technologies that have emerged to support the growth in wireless voice and data applications.

Cast List:

3G smartphones – Nokia, Motorola

Bluetooth earbuds – Sound ID

Heads-up display – Microvision

Voice-driven push-to-talk – Sonim

Voice control – Advanced Recognition Technologies

Personal digital assistant – Wildfire

Electronic ink pad – E Ink, Philips Electronics

Music capture – Shazam Entertainment

Foldable Light Emitting Polymer Display – Technology from Cambridge Display Technology

Augmented reality game console – Nokia N-Gage 4

Intelligent mobile agents – Hewlett Packard

Geo-location technology – Openwave

Where are these companies in 2023?

My original cast of technology characters has seen mixed fortunes, some are still around but with different owners while others have disappeared altogether. Few are still going in their original business niche:

NokiaandMotorolaare brands that are still making mobile devices, but in different guises than in 2003.

I don’t know what became ofSound ID. There is an app called SoundID created by Sonar Works, but it is different and unrelated to the Sound ID identified in the article. But Bluetooth True Wireless earbuds are now ahuge market.

Microvisionis still in business but has shifted its focus to LiDAR in the automotive space.

Sonimis still in business and still making ruggedized devices, including push-to-talk devices for the safety and security sectors.

Advanced Recognition Technologieswas acquired by Scansoft in 2005.

Wildfirewas an innovative voice-controlled personal assistant that was acquired by the operator Orange in 2000. But Orange killed the service in 2005.

E-Inkstill exists, although Philips parted ways with it in 2005.

Shazamstill exists but was acquired by Apple in 2018. When it started in 2002, you had to dial a short number and hold your phone to the sound source. Users would then receive an SMS with the song title and artist.

Cambridge Display Technologyis still around. It was floated on Nasdaq in 2004 and acquired by Sumitomo Chemical in 2007.

Hewlett Packardis clearly still around. However, it doesn’t make intelligent software agents. But then again, neither does anyone else, at least not in the way portrayed in the article.

Openwaveno longer exists, although many of its businesses have been absorbed into other entities.

Artificial Intelligence: Irrational Exuberance is in Full Swing

As surely as autumn and winter follow summer, the current exuberance aroundAIis not going to last simply because the machines remain incapable of living up to the expectations that have been set for them.

These cycles typically take the form of a discovery of some description followed by a ramping of expectations which in turn leads to large amounts of money being invested for fear of missing out (FOMO).

The problem is that the expectations that are set are always unrealistic, meaning that when the time comes to deliver on those expectations, disappointment sets in. This is followed by collapsing valuations, bankruptcies and forced consolidation as investors are no longer willing to suspend disbelief.

This is the fourth AI Hype cycle with the others occurring in the 1960s, 1980s and 2017-2019, and this hype cycle looks exactly the same as the others except that it is much larger. Looking at investment activity and news flow, it is also very clear exactly where we are in the cycle.

First, expectations

  • The ability of Large Language Models (LLMs) to mimic human behavior has convinced some of the big names (like Professor Geoffrey Hinton) that artificial superintelligence is now materially closer than it was before.
  • While LLMs do have some very useful and lucrative use cases, they still have no causal understanding of the tasks they are performing.
  • This is why they hallucinate, make the most basic factual errors and are generally completely unreliable.
  • Therefore, the machines remain as stupid as ever. There is no evidence whatsoever that these machines are able to think.
  • But the problem is that they are so good at pretending to think that they are able to fool the great minds that created them.
  • Instead, all they do is calculate statistical relationships, meaning that the big promises that have been made will not be kept.

Second, investment

  • There are already many examples of money being thrown at start-ups with valuations and fundamentals being an afterthought:
  • OpenAI’s $30-billion valuation with a corporate culture that doesn’t want to make any profit.
  • Inflexion AI raising $1.3 billion fromMicrosoftandNVIDIAat an estimated valuation of around $5 billion despite having only been around for a year and having no commercial product.
  • Mistral AI raising $113 million at a $260-million pre-money valuation despite being only a few weeks old with no revenues, no product and probably only the vaguest idea of what it is going to do.
  • This can be described as the very definition of a bubble where rationality gets lost in the mad rush toward the next big thing. A lot of shirts are going to be lost.

The latest innovations around LLMs have produced some remarkable abilities which, no doubt, will be put to both good and lucrative use. However, the technology upon which they are based has not changed, meaning that the limitations that preventeddigitalassistants and autonomous driving from being useful for anything more than the most basic tasks are also going to trip LLMs up.

Furthermore, this is no longer the exclusive realm of the big, well-financed companies that can pay tens of millions of dollars for massive compute capacity, as the hobbyists and enthusiasts are now creating generative AI.MetaPlatforms’ series of LLMs called LlaMa are now freely available to anyone who wants to tinker and advances in training techniques have meant that it is possible to fine-tune a 7bn parameter model on a powerful laptop.

This is why there are models popping up all over the place that are completely free to use. Some of them actually work quite well. Hence, the pricing of $20 per month for services likeGPT-4, Perplexity AI and Midjourney may soon come under relentless pressure. This is really bad news for investors relying on spreadsheets for their return because no one seems to have modeled this scenario out.

The first sign of trouble will come when companies come back to the market after spending the money on fancy offices and expensive staff but nothing to show for the investments so far. This is when the down rounds begin, disillusionment sets in, reality makes its presence felt and winter begins.

有人怀疑这将开始在第一half of 2024 and the fallout will not be pretty.

(This is a version of a blog that first appeared on Radio Free Mobile. All views expressed are Richard’s own.)


相关的帖子

Colombia Telecom Meet Focuses on Digital Divide, 5G Launch

    • Colombian telecom operators and government are positive about the outcome of the 5G spectrum auction in December.
    • 政府、电信运营商和监管bodies want to increase broadband penetration and provide training to Colombians to bridge the digital divide.
    • Telecom operators ETB and Claro shared their plans to leverage 5G to enhance verticals like education, mobility, and B2B services.
    Even asColombialooks forward to holding its first 5G auction in December this year, it has much more basic and long-pending telecom sector-related issues that need the attention of all stakeholders, whether the operators or government departments. It is with this in mind that representatives from the Colombian government, regulatory bodies, telecom operators and manufacturers gathered in the nation’s capital, Bogota, at the end of June for the Conecta Colombia summit. The event, which is a part of the series that Conecta Latam organizes in the region, saw discussions on regulation, technology and, of course, business. A team of analysts from Counterpoint was also present at the event. Here are their key takeaways:
    5G and connectivity expansionThe potential of 5G technology and its impact on various sectors were essential discussion topics at the event. Telecom operators ETB and Claro shared their plans to leverage 5G to enhance verticals like education, mobility, and B2B services. Investments in broadband and 4G were also emphasized as ongoing priorities alongside the imminent 5G auction. The discussions also recognized the need to focus on network coverage and connectivity expansion, ensuring that before the widespread adoption of 5G.
    1 e1690873241305
    Main 5G verticals to be developed in Colombia, including a $166 billion opportunity until 2035 as per Hugo Chang at Nokia.
    Security and digital transformation
    Bridging the digital divideAccording to the regulators, by the end of Q1 2023, Colombia had 9 million fixed broadband connections, which means that only approx. 61% of households in Colombia have access to fixed broadband. Highlighting Colombia’s digital divide, speakers at the summit emphasized the need to address challenges Colombians have such as lack of technical skills and connectivity. The government, telecom operators and regulatory bodies discussed strategies to increase broadband penetration, provide training to Colombians, involve regional stakeholders in the sector decision-making, and improve job formalization. The focus was on leveraging technology to narrow the digital divide and ensure equal access to digital opportunities.
    그림2 2
    “Centros Digitales” at the core of the solution to bridge the digital divide. MinTic aims to reach 85% broadband penetration by 2031 compared to 61% in Q1 2023.
    5G and connectivity expansionThe potential of 5G technology and its impact on various sectors were essential discussion topics at the event. Telecom operators ETB and Claro shared their plans to leverage 5G to enhance verticals like education, mobility, and B2B services. Investments in broadband and 4G were also emphasized as ongoing priorities alongside the imminent 5G auction. The discussions also recognized the need to focus on network coverage and connectivity expansion, ensuring that before the widespread adoption of 5G.
    1 e1690873241305
    Main 5G verticals to be developed in Colombia, including a $166 billion opportunity until 2035 as per Hugo Chang at Nokia.
    Security and digital transformation
    During 2022 there were 20 billion failed cyberattacks in Colombia as disclosed by a Fortinet study, based on that government officials and telecom operators highlighted the importance of cybersecurity for the industry. Panel discussions emphasized the need for a national entity to coordinate carrier efforts and use artificial intelligence (AI) as a defensive tool. Besides, the event explored the concept of digital transformation withintelecomcompanies, with discussions revolving around employee training, customer experience and the transformation of big data into “mega data” facilitated by 5G. The role of alternative revenue drivers such as VoLTE and MVNO products were also highlighted as options before the 5G benefits come to play.
    4 1 e1690873361828
    Saul Kattan, technical consultant of the Colombian presidency leads the panel discussion on the role of a national cybersecurity entity sponsored by the government.
    What is next?On August 1, the regulator will publish the first draft of the auction process, covering the mechanism to be followed and the required technical conditions for the applicants. The second draft will be released one month later and the final in October. In November and December, all the interested parties will submit their applications for the auction, which will start on December 20. 2024 can become the year of the 5G availability in the country.
    5 e1690873422725
    5G auction and 4G spectrum renovations chronogram.
    Bridging the digital divideAccording to the regulators, by the end of Q1 2023, Colombia had 9 million fixed broadband connections, which means that only approx. 61% of households in Colombia have access to fixed broadband. Highlighting Colombia’s digital divide, speakers at the summit emphasized the need to address challenges Colombians have such as lack of technical skills and connectivity. The government, telecom operators and regulatory bodies discussed strategies to increase broadband penetration, provide training to Colombians, involve regional stakeholders in the sector decision-making, and improve job formalization. The focus was on leveraging technology to narrow the digital divide and ensure equal access to digital opportunities.
    그림2 2
    “Centros Digitales” at the core of the solution to bridge the digital divide. MinTic aims to reach 85% broadband penetration by 2031 compared to 61% in Q1 2023.
    5G and connectivity expansionThe potential of 5G technology and its impact on various sectors were essential discussion topics at the event. Telecom operators ETB and Claro shared their plans to leverage 5G to enhance verticals like education, mobility, and B2B services. Investments in broadband and 4G were also emphasized as ongoing priorities alongside the imminent 5G auction. The discussions also recognized the need to focus on network coverage and connectivity expansion, ensuring that before the widespread adoption of 5G.
    1 e1690873241305
    Main 5G verticals to be developed in Colombia, including a $166 billion opportunity until 2035 as per Hugo Chang at Nokia.
    Security and digital transformation
    During 2022 there were 20 billion failed cyberattacks in Colombia as disclosed by a Fortinet study, based on that government officials and telecom operators highlighted the importance of cybersecurity for the industry. Panel discussions emphasized the need for a national entity to coordinate carrier efforts and use artificial intelligence (AI) as a defensive tool. Besides, the event explored the concept of digital transformation withintelecomcompanies, with discussions revolving around employee training, customer experience and the transformation of big data into “mega data” facilitated by 5G. The role of alternative revenue drivers such as VoLTE and MVNO products were also highlighted as options before the 5G benefits come to play.
    4 1 e1690873361828
    Saul Kattan, technical consultant of the Colombian presidency leads the panel discussion on the role of a national cybersecurity entity sponsored by the government.
    What is next?On August 1, the regulator will publish the first draft of the auction process, covering the mechanism to be followed and the required technical conditions for the applicants. The second draft will be released one month later and the final in October. In November and December, all the interested parties will submit their applications for the auction, which will start on December 20. 2024 can become the year of the 5G availability in the country.
    5 e1690873422725
    5G auction and 4G spectrum renovations chronogram.

    AI Needs to Reside in the Vehicle to Work Well

    Mercedes is running a beta program where those that opt in will be able to accessChatGPTfrom their vehicle by interacting with thevoice assistantalready present in MBUX-equipped vehicles. But rather than the cloud-based service that Mercedes is going with today, it should be looking at implementing ChatGPT directly in the vehicle.

    • Mercedes owners in theUScan enroll for the program by accepting an update for their car.
    • The test is due to run for three months and is being supported by Microsoft’s Azure OpenAI Service, which is an API to which clients can connect their services to havegenerative AIfunctionality.
    • Mercedes is able to implement this service very easily because all it is really doing is providing a prompt for the vehicle assistant to fill in, send it to the cloud and then read out the results.
    • This means that all of the inference or processing of the request will be done in the cloud with the voice assistant doing nothing more than acting as a front end to provide the voice functionality.
    • The vehicle is a use case where generative AI could have a disproportionately large impact. This is because a touch-based icon grid is a substandard user experience no matter who provides it.
    • The problem that the car makers have is that their icon grid is much worse than Apple, Gooxgle orTesla.
    • Furthermore, in 2016 and 2017 we concluded that voice was the leading contender to improve the digital experience in the vehicle but that voice was not good enough to create an acceptable user experience.
    • This is why vehicles are still limping along with smartphones embedded in the dashboard.
    • We have also concluded that generative AI represents a significant step forward in the ability of machines to communicate with humans and provide a user interface for a digital service.
    • Consequently, generative AI offers a significant opportunity for vehicle makers to win back the digital initiative that they have ceded to the digital ecosystems.
    • This is extremely important as vehicle makers’ ability to monetize the market for in-vehicle digital service will be contingent on their ability to remain relevant in the digital vehicle experience.
    • This is why Apple andGoogleare coming aggressively after the vehicle and so far, the OEMs have mounted feeble resistance or offered complete capitulation.
    • The problem with this approach is that the only way to implement generative AI effectively in the vehicle is to put it directly in the vehicle.
    • This is because reliability and speed are critical, and in this example when the network goes, the service goes with it.
    • Furthermore, it is unlikely that there will be any real integration with the vehicle, meaning that telling ChatGPT that one is feeling hot is likely to result in silence rather than the air-conditioner being turned up.
    • Using ChatGPT as the benchmark implementation in the vehicle will have a profound impact on the cost of the vehicle’s electronics as well as its power consumption which in anEVis a deal breaker.
    • There are rapid developments going on in the open-source community that may make this a lot easier to achieve, but implementing large language models outside of the data center remains a work in progress.
    • Despite the current limitations, the potential for generative AI to help OEMs to overcome their digital shortcomings is substantial and represents one of the best opportunities the OEMs have had for a long time.
    • The risk is that if no one uses it as a result of the way the Mercedes experiment is implemented, it will lead to the (wrong) conclusion that putting it in the vehicle is a waste of time.
    • This would lead to the squandering of another opportunity, resulting in digital irrelevance and greater commoditization.
    • We remain pretty pessimistic about the outlook for the OEMs.

    (This is a version of a blog that first appeared on Radio Free Mobile. All views expressed are Richard’s own.)

    相关的帖子

    Podcast #66: HERE UniMap – Leveraging AI and Automation to Redefine Mapmaking at Scale

    AtCES 2023,HERETechnologies unveiled UniMap, an automated mapping technology that allows customers to create their own personal maps and customize location services. It uses AI models to automate the processing of vehicle probes and sensor data to validate speed limits, extract map features like 2D and 3D positioning of road signs and much more.

    With more vehicles gaining autonomous features, UniMap also aligns standard definition (SD), high definition (HD), and Advanced-Driver-Assistance System (ADAS) to offer seamless access to navigation data. Customers also get quick access to unified map data that is refreshed in hours, minutes, and even seconds. The data is easily accessible at any time of the day because it is all saved in a single environment. In this podcast, many other unique features of the UniMap platform are discussed in detail.

    In the latest episode of ‘The Counterpoint Podcast’, hostPeter Richardsonis joined by Vice President of Product and Technology atHERE Technologies,Remco Timmer, andCounterpoint’s Research AnalystMohit Sharmato talk about UniMap. The podcast weaves through various topics like the use cases of UniMap, the industry sectors that can benefit from this new mapping solution, and much more.

    Click to listen to the podcast

    You can read the podcast transcripthere.

    Podcast Chapter Markers

    01:18:Remco Timmer gives a brief introduction of himself and his role at HERE Technologies.

    02:22:Mohit Sharma talks about Counterpoint’s location platform research, location intelligence, and more.

    06:20:Between standard definition and high-definition maps, where does UniMap fit in the picture? Timmer explains.

    09:22:With terabytes of data, how does the system pick up important vs irrelevant one? Timmer explains further.

    11:45:Timmer on how UniMap can be beneficial to the transport and logistics sector.

    13:65:Timmer talks about some interesting use cases for UniMap.

    15:10:Sharma talks about mapping solutions from a broad industry view.

    18:00:Remco Timmer on when we can expect to see the first full integration of UniMap.

    19:20:Final comments from Sharma on the overall mapping industry.

    Also available for listening/download on:

    apple podcasts logo spotify tune in radio google podcasts logo

    相关的帖子

    COMPUTEX 2023: AI Solutions, Capabilities in Focus

    科技巨头展示了他们最先进的解决方案s in AI and computing at the COMPUTEX 2023 show in Taipei in the first week of June. If NVIDIA CEO Jensen Huang’s keynote address focused on the company’s game-changing innovations around AI, Arm CEO Rene Haas’ keynote had compelling demonstrations showcasing Arm’s capabilities in AI. Qualcomm focused on on-device intelligence enhancement and Hybrid AI as the mainstream format ofAIin the future. With meaningful upgrades in computing capability, we expect to see the beginning of a new chapter in the coming years. In the following sections, we summarize the key takeaways from COMPUTEX 2023.

    NVIDIA: Grace Hopper Superchip to boost AI revolutionS 10715151 e1687763755353

    The world’s first-of-its-kind Grace Hopper Superchip, which is manufactured by the TSMC 4nm process node, is likely to level up NVIDIA’s determination on AI. In addition, more GPUs will be used for generative AI training and inference models, which willacceleratethe transformative technology in the near term, highlighted Mr. Huang.

    Our Associate Director,Brady Wang,shared his ideas and insights at the influential event.

    S 10715143

    NVIDIA also introduced the Spectrum-X platform, a fusion of the Spectrum-4 switch and Bluefield-3 DPUs, which boasts a record 51Tb/sec Ethernet speed and is tailor-made forAI networks. Combined with BlueField-3 DPUs and NVIDIA LinkX optics, it forms an end-to-end 400GbE network optimized for AI clouds. This innovation not only fits NVIDIA’s target but also consumes a great amount of foundry capacity, especially TSMC.

    NVIDIA ACE Framework

    Counterpoint Research - Nvidia ACE_Dark
    Source: Nvidia

    NVIDIA: Avatar Cloud Engine (ACE) for Games

    NVIDIA did not forget its loyal gamers. This time, it introduced the Avatar Cloud Engine (ACE) for Games, a groundbreaking custom AI model service. ACE empowers non-playable characters (NPCs) in games with AI-driven natural language interactions, revolutionizing the gaming experience. With ACE, gamers can enjoy more immersive and intelligent gameplay.

    Notes from analyst Q&A with NVIDIA founder and CEO Jensen Huang

    • If most of the workload involves training AI models, the data center operates as an AI factory. An optimal computer is capable of handling both training and inference tasks, although the selection of processors is contingent upon the specific inference type.Picture3
    • In the foreseeable future, AI is poised to become the predominant force within the realm of NPCs in video games. These NPCs will possess a distinctive narrative and contextual background, effortlessly engaging with one another in a harmonious manner. Their movements will be fluid and their comprehension of instructions will be exceptional.
    • AI revolutionizes the user experience onPCs, propelling the advancement of personalized recommender systems. Furthermore, even on compact smartphones, it taps into extensive personalized internet data. As a result, future interactions generate real-time, customized content, transitioning towards generative processing to accommodate the escalating demand for personalized information. This groundbreaking development signifies the dawning of a new era characterized by the proliferation of generated and augmented information, thereby departing from the previously dominant retrieval-centric paradigm.
    • InfiniBand excels in high-performance computing, offering superior throughput for single computers and AI factories. It dominates in supercomputers and AI systems, while Ethernet is prevalent in cloud environments.
    • China leads the way in cloud services, consumer internet and digital payments. It has swiftly advanced in electric and autonomous vehicles, showcasing local innovation through numerous GPU start-ups. This underscores China’s technological dominance and promising future growth.
    • Omniverse streamlines computer setup with cloud integration and partnerships, enabling effortless information streaming through browser-based access. It optimizes factory design and simulation, minimizing work, errors and expenses.

    Arm: Everything now is a computer; AI runs on Arm

    Citing the remarkable 260% increase in data center workloads from 2015 to 2021,ArmCEO Rene Haas emphasized the pivotal role of data centers, automotive technology and AI in driving the compute demand powered by ARM designs.

    According to Counterpoint Research,Arm-based notebookswill gain over Intel and AMD, almost doubling their shipment share to 25% by 2027 from 14% today.

    Laptop Shipment Share by CPU/SoC Type %

    Counterpoint Research - laptop shipment by CPU type

    source: Counterpoint Research

    AI emerged as a focal point during Haas’ keynote, where he captivated the audience with compelling demonstrations showcasing Arm’s capabilities. That said, Arm is poised to support an even broader range of applications in the future.

    We also echo Arm’s view and believe that generative AI, digital twins and edge computing will emerge as top technology trends in 2023 and affect the whole tech industry.

    Qualcomm: Focus on on-device intelligence enhancement, Hybrid AI

    Qualcomm’s “AI” Hexagon processor offers 3-5 times better computing performance compared to existing CPU/GPU solutions. The company aims to expand the application of its Snapdragon 8cx Gen 3 processor to the laptop industry with thorough support from Microsoft.

    For AI, Qualcomm believes there are limitations to the efficiency improvements of Edge AI or Cloud AI. However, leveraging Qualcomm’s connectivity solutions and combining edge AI with cloud AI can provide incremental benefits in terms of cost/energy savings, privacy and security enhancements, reliability, and latency. Therefore, the company guides that Hybrid AI (Edge AI + Cloud AI) will be the mainstream format of AI in the future.

    Conclusion

    With the COVID-19 pandemic loosening its grip,COMPUTEXwas back in its on-site mode this year in Taipei with solid and eye-catching AI solutions. We are stepping into a newcomputing与生成人工智能时代,将改变我们的生活. Not only server vendors and data center hyper-scalers, but mobile and PC vendors are also working together to facilitate technology improvements with AI support.

    Counterpoint’s analysts continue to work closely with the tech product market to monitor all changes and trends.

    相关的帖子

    Term of Use and Privacy Policy

    Counterpoint Technology Market Research Limited

    Registration

    In order to access Counterpoint Technology Market Research Limited (Company or We hereafter) Web sites, you may be asked to complete a registration form. You are required to provide contact information which is used to enhance the user experience and determine whether you are a paid subscriber or not.
    当你注册个人信息我们问you for personal information. We use this information to provide you with the best advice and highest-quality service as well as with offers that we think are relevant to you. We may also contact you regarding a Web site problem or other customer service-related issues. We do not sell, share or rent personal information about you collected on Company Web sites.

    How to unsubscribe and Termination

    You may request to terminate your account or unsubscribe to any email subscriptions or mailing lists at any time. In accessing and using this Website, User agrees to comply with all applicable laws and agrees not to take any action that would compromise the security or viability of this Website. The Company may terminate User’s access to this Website at any time for any reason. The terms hereunder regarding Accuracy of Information and Third Party Rights shall survive termination.

    Website Content and Copyright

    This Website is the property of Counterpoint and is protected by international copyright law and conventions. We grant users the right to access and use the Website, so long as such use is for internal information purposes, and User does not alter, copy, disseminate, redistribute or republish any content or feature of this Website. User acknowledges that access to and use of this Website is subject to these TERMS OF USE and any expanded access or use must be approved in writing by the Company.
    – Passwords are for user’s individual use
    – Passwords may not be shared with others
    – Users may not store documents in shared folders.
    – Users may not redistribute documents to non-users unless otherwise stated in their contract terms.

    Changes or Updates to the Website

    The Company reserves the right to change, update or discontinue any aspect of this Website at any time without notice. Your continued use of the Website after any such change constitutes your agreement to these TERMS OF USE, as modified.
    Accuracy of Information: While the information contained on this Website has been obtained from sources believed to be reliable, We disclaims all warranties as to the accuracy, completeness or adequacy of such information. User assumes sole responsibility for the use it makes of this Website to achieve his/her intended results.

    Third Party Links: This Website may contain links to other third party websites, which are provided as additional resources for the convenience of Users. We do not endorse, sponsor or accept any responsibility for these third party websites, User agrees to direct any concerns relating to these third party websites to the relevant website administrator.

    Cookies and Tracking

    We may monitor how you use our Web sites. It is used solely for purposes of enabling us to provide you with a personalized Web site experience.
    This data may also be used in the aggregate, to identify appropriate product offerings and subscription plans.
    Cookies may be set in order to identify you and determine your access privileges. Cookies are simply identifiers. You have the ability to delete cookie files from your hard disk drive.
    This site is registered onwpml.orgas a development site.