Machine Learning
MLCommons, the consortium behind the MLPerf family of machine learning benchmarks, is announcing this morning that the organization will be developing a new desktop AI benchmarking suite under the MLPerf banner. Helmed by the body’s newly-formed MLPerf Client working group, the task force will be developing a client AI benchmark suit aimed at traditional desktop PCs, workstations, and laptops. According to the consortium, the first iteration of the MLPerf Client benchmark suite will be based on Meta’s Llama 2 LLM, with an initial focus on assembling a benchmark suite for Windows. The de facto industry standard benchmark for AI inference and training on servers and HPC systems, MLCommons has slowly been extending the MLPerf family of benchmarks to additional devices over the past several years...
AMD: Partial RDNA 3 Video Card Support Coming to Future ROCm Releases
AMD this morning is formally announcing the launch of the latest version of its GPU compute software stack, ROCm 5.7. Along with making several important updates to the software...
15 by Ryan Smith on 6/29/2023Intel Discloses New Details On Meteor Lake VPU Block, Lays Out Vision For Client AI
While the first systems based on Intel’s forthcoming Meteor Lake (14th Gen Core) systems are still at least a few months out – and thus just a bit too...
17 by Ryan Smith on 5/29/2023NVIDIA: Grace Hopper Has Entered Full Production & Announcing DGX GH200 AI Supercomputer
Teeing off an AI-heavy slate of announcements for NVIDIA, the company has confirmed that their Grace Hopper “superchip” has entered full production. The combination of a Grace CPU and...
8 by Ryan Smith on 5/29/2023NVIDIA Announces H100 NVL - Max Memory Server Card for Large Language Models
While this year’s Spring GTC event doesn’t feature any new GPUs or GPU architectures from NVIDIA, the company is still in the process of rolling out new products based...
25 by Ryan Smith on 3/21/2023NVIDIA Hopper GPU Architecture and H100 Accelerator Announced: Working Smarter and Harder
Depending on your point of view, the last two years have either gone by very slowly, or very quickly. While the COVID pandemic never seemed to end – and...
88 by Ryan Smith on 3/22/2022Cerebras Completes Series F Funding, Another $250M for $4B Valuation
Every once in a while, a startup comes along with something out of left field. In the AI hardware generation, Cerebras holds that title, with their Wafer Scale Engine...
25 by Dr. Ian Cutress on 11/10/2021NVIDIA Launches A2 Accelerator: Entry-Level Ampere For Edge Inference
Alongside a slew of software-related announcements this morning from NVIDIA as part of their fall GTC, the company has also quietly announced a new server GPU product for the...
16 by Ryan Smith on 11/9/2021Hot Chips 2021 Live Blog: Machine Learning (Graphcore, Cerebras, SambaNova, Anton)
Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...
3 by Dr. Ian Cutress on 8/24/2021Hot Chips 2021 Live Blog: Machine Learning (Esperanto, Enflame, Qualcomm)
Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...
0 by Dr. Ian Cutress on 8/24/2021Hot Chips 2021 Keynote Live Blog: Designing Chips with AI, Synopsys
Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...
0 by Dr. Ian Cutress on 8/23/2021Cadence Cerebrus to Enable Chip Design with ML: PPA Optimization in Hours, not Months
The design of most leading edge processors and ASICs rely on steps of optimization, with the three key optimization points being Performance, Power, and Area (and sometimes Cost). Once...
20 by Dr. Ian Cutress on 7/22/2021Using AI to Build Better Processors: Google Was Just the Start, Says Synopsys
In light of the rate of innovation, chip design teams have spent tens of thousands of hours honing their skills over the decades. But getting the best human-designed processor...
100 by Dr. Ian Cutress on 6/23/2021Xilinx Expands Versal AI to the Edge: Helping Solve the Silicon Shortage
Today Xilinx is announcing an expansion to its Versal family, focused specifically on low power and edge devices. Xilinx Versal is the productization of a combination of many different...
25 by Dr. Ian Cutress on 6/9/2021MLPerf Inference v1.0: 2000 Suite Results, New Power Measurements
There has been a strong desire for a series of industry standard machine learning benchmarks, akin to the SPEC benchmarks for CPUs, in order to compare relative solutions. Over...
11 by Dr. Ian Cutress on 4/21/2021Graphcore Series E Funding: $710m Total, $440m Cash-in-Hand
For those that aren’t following the AI industry, one of the key metrics to observe for a number of these AI semiconductor startups is the amount of funding they...
12 by Dr. Ian Cutress on 1/4/2021Qualcomm's Cloud AI 100 Now Sampling: Up to 400TOPs at 75W
Today Qualcomm is revealing more information on last year’s announced “Cloud AI 100” inference chip and platform. The new inference platform by the company is said to have entered...
15 by Andrei Frumusanu on 9/16/2020342 Transistors for Every Person In the World: Cerebras 2nd Gen Wafer Scale Engine Teased
One of the highlights of Hot Chips from 2019 was the startup Cerebras showcasing its product – a large ‘wafer-scale’ AI chip that was literally the size of a...
32 by Dr. Ian Cutress on 8/18/2020Arm Announces Ethos-N78 NPU: Bigger And More Efficient
Yesterday Arm released the new Cortex-A78, Cortex-X1 CPUs and the new Mali-G78 GPU. Alongside the new “key” IPs from the company, we also saw the reveal of the newest...
34 by Andrei Frumusanu on 5/27/2020AMD Unveils CDNA GPU Architecture: A Dedicated GPU Architecture for Data Centers
Over the last decade, the industry has seen a boom in demand for GPUs for the data center. Driven in large part by rapid progress in neural networking, deep...
26 by Ryan Smith on 3/5/2020