Information Technology

12.04.2021

Machine learning at speed

Technology developed through a KAUST-led collaboration with Intel, Microsoft and the University of Washington can dramatically increase the speed of machine learning on parallelized computing systems.
Credit: © 2021 KAUST; Anastasia Serin

Inserting lightweight optimization code in high-speed network devices has enabled a KAUST-led collaboration to increase the speed of machine learning on parallelized computing systems five-fold.

This “in-network aggregation” technology, developed with researchers and systems architects at Intel, Microsoft and the University of Washington, can provide dramatic speed improvements using readily available programmable network hardware.

The fundamental benefit of artificial intelligence (AI) that gives it so much power to “understand” and interact with the world is the machine-learning step, in which the model is trained using large sets of labeled training data. The more data the AI is trained on, the better the model is likely to perform when exposed to new inputs.

The recent burst of AI applications is largely due to better machine learning and the use of larger models and more diverse datasets. Performing the machine-learning computations, however, is an enormously taxing task that increasingly relies on large arrays of computers running the learning algorithm in parallel.

“How to train deep-learning models at a large scale is a very challenging problem,” says Marco Canini from the KAUST research team. “The AI models can consist of billions of parameters, and we can use hundreds of processors that need to work efficiently in parallel. In such systems, communication among processors during incremental model updates easily becomes a major performance bottleneck.”

The team found a potential solution in new network technology developed by Barefoot Networks, a division of Intel.

“We use Barefoot Networks’ new programmable dataplane networking hardware to offload part of the work performed during distributed machine-learning training,” explains Amedeo Sapio, a KAUST alumnus who has since joined the Barefoot Networks team at Intel. “Using this new programmable networking hardware, rather than just the network, to move data means that we can perform computations along the network paths.”

The key innovation of the team’s SwitchML platform is to allow the network hardware to perform the data aggregation task at each synchronization step during the model update phase of the machine-learning process. Not only does this offload part of the computational load, it also significantly reduces the amount of data transmission.

“Although the programmable switch dataplane can do operations very quickly, the operations it can do are limited,” says Canini. “So our solution had to be simple enough for the hardware and yet flexible enough to solve challenges such as limited onboard memory capacity. SwitchML addresses this challenge by co-designing the communication network and the distributed training algorithm, achieving an acceleration of up to 5.5 times compared to the state-of-the-art approach.”

Media Contact

Michael Cusack

King Abdullah University of Science & Technology (KAUST)

EurekAlert!

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Laser-based artificial neuron mimics nerve cell functions at lightning speed

20.12.2024 / Information Technology

Hyperspectral imaging lidar system achieves remote plastic identification

18.12.2024 / Information Technology

How SMEs are Successfully Using Artificial Intelligence

18.12.2024 / Information Technology

European Chiplet Innovation

17.12.2024 / Information Technology

Back to home

Comments (0) Cancel reply

Newest articles

Ecology, The Environment and Conservation

First-of-its-kind study uses remote sensing to monitor plastic debris in rivers and lakes

Remote sensing creates a cost-effective solution to monitoring plastic pollution. A first-of-its-kind study from researchers at the University of Minnesota Twin Cities shows how remote sensing can help monitor and…

20.12.2024

Information Technology

Laser-based artificial neuron mimics nerve cell functions at lightning speed

With a processing speed a billion times faster than nature, chip-based laser neuron could help advance AI tasks such as pattern recognition and sequence prediction. Researchers have developed a laser-based…

20.12.2024

Ecology, The Environment and Conservation

Optimising the processing of plastic waste

Just one look in the yellow bin reveals a colourful jumble of different types of plastic. However, the purer and more uniform plastic waste is, the easier it is to…

20.12.2024

Receive funding for your R&D Projects!

Svenja Heimerl

News and reports

Latest News

First-of-its-kind study uses remote sensing to monitor plastic debris in rivers and lakes

Laser-based artificial neuron mimics nerve cell functions at lightning speed

Optimising the processing of plastic waste

Anomalous magnetic moment of the muon

Machine learning at speed

Original Source

Media Contact

Laser-based artificial neuron mimics nerve cell functions at lightning speed

Hyperspectral imaging lidar system achieves remote plastic identification

How SMEs are Successfully Using Artificial Intelligence

European Chiplet Innovation

Comments (0) Cancel reply

Newest articles

First-of-its-kind study uses remote sensing to monitor plastic debris in rivers and lakes

Laser-based artificial neuron mimics nerve cell functions at lightning speed

Optimising the processing of plastic waste