Inteligencia artificial LLM ... | Apuntes de Inteligencia Artificial

AI and ML Accelerator Survey and Trends

Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner

MIT Lincoln Laboratory Supercomputing Center

Lexington, MA, USA

{reuther,pmichaleas,michael.jones,vijayg,sid,kepner}@ll.mit.edu

Abstract—This paper updates the survey of AI accelerators

and processors from past three years. This paper collects and

summarizes the current commercial accelerators that have been

publicly announced with peak performance and power consump-

tion numbers. The performance and power values are plotted on

a scatter graph, and a number of dimensions and observations

from the trends on this plot are again discussed and analyzed.

Two new trends plots based on accelerator release dates are

included in this year’s paper, along with the additional trends

of some neuromorphic, photonic, and memristor-based inference

accelerators.

Index Terms—Machine learning, GPU, TPU, dataflow, accel-

erator, embedded inference, computational performance

I. INTRODUCTION

Just as last year, the pace of new announcements, releases,

and deployments of artificial intelligence (AI) and machine

learning (ML) accelerators from startups and established tech-

nology companies has been modest. This is not unreason-

able; for many companies that have released an accelerator

report having spent three or four years researching, analyzing,

designing, verifying, and validating their accelerator design

trade-offs and building the software stack to program the

accelerator. For those who have released subsequent versions

of their accelerator, they have reported shorter development

cycles, though it is still at least two or three years. The focus of

these accelerators continues to be on accelerating deep neural

network (DNN) models, and the application space spans from

very low power embedded voice recognition and image clas-

sification to data center scale training, while the competition

for defining markets and application areas continues as part

of a much larger industrial and technology shift in modern

computing to machine learning solutions.

AI ecosystems bring together components from embed-

ded computing (edge computing), traditional high perfor-

mance computing (HPC), and high performance data analy-

sis (HPDA) that must work together to effectively provide

capabilities for use by decision makers, warfighters, and

analysts [1]. Figure 1 captures an architectural overview of

such end-to-end AI solutions and their components. On the

left side of Figure 1, structured and unstructured data sources

provide different views of entities and/or phenomenology.

This material is based upon work supported by the Assistant Secretary

of Defense for Research and Engineering under Air Force Contract No.

FA8702-15-D-0001. Any opinions, findings, conclusions or recommendations

expressed in this material are those of the author(s) and do not necessarily

reflect the views of the Assistant Secretary of Defense for Research and

Engineering.

Fig. 1: Canonical AI architecture consists of sensors, data con-

ditioning, algorithms, modern computing, robust AI, human-

machine teaming, and users (missions). Each step is critical

in developing end-to-end AI applications and systems.

These raw data products are fed into a data conditioning step

in which they are fused, aggregated, structured, accumulated,

and converted into information. The information generated by

the data conditioning step feeds into a host of supervised

and unsupervised algorithms such as neural networks, which

extract patterns, predict new events, fill in missing data, or

look for similarities across datasets, thereby converting the

input information to actionable knowledge. This actionable

knowledge is then passed to human beings for decision-

making processes in the human-machine teaming phase. The

phase of human-machine teaming provides the users with

useful and relevant insight turning knowledge into actionable

intelligence or insight.

Underpinning this system are modern computing systems.

Moore’s law trends have ended [2], as have a number of related

laws and trends including Denard’s scaling (power density),

clock frequency, core counts, instructions per clock cycle,

and instructions per Joule (Koomey’s law) [3]. Taking a page

from the system-on-chip (SoC) trends first seen in automotive

applications, robotics, and smartphones, advancements and

innovations are still progressing by developing and integrating

accelerators for often-used operational kernels, methods, or

functions. These accelerators are designed with a different

balance between performance and functional flexibility. This

includes an explosion of innovation in deep machine learning

processors and accelerators [4]–[8]. In this series of survey

papers, we explore the relative benefits of these technologies

since they are of particular importance to applying AI to

domains under significant constraints such as size, weight, and

arXiv:2210.04055v1 [cs.AR] 8 Oct 2022

Inteligencia artificial LLM ..., Apuntes de Inteligencia Artificial

Documentos relacionados

Vista previa parcial del texto

¡Descarga Inteligencia artificial LLM ... y más Apuntes en PDF de Inteligencia Artificial solo en Docsity!