[Distinguished Lecture Series] RNGD: Design and Implementation of PyTorch-Compatible Programming Interfaces for Tensor Contraction Processors ⋅ 세미나 ⋅ 서울대학교 컴퓨터공학부

세미나

[Distinguished Lecture Series] RNGD: Design and Implementation of PyTorch-Compatible Programming Interfaces for Tensor Contraction Processors

Capsule Introduction.pdf(2.03MiB)

이름: 백준호 대표이사, 김솔 Technical Staff

소속: Furiosa AI

주최: 허충길 교수

날짜: 2025/12/04 오전 11:00 - 오후 12:00

위치: 302동 105호

요약

The future of AI is in its sustainability. As more complex and heavier AI models are being applied to diverse industries, the demand for energy increases exponentially to an unprecedented numbers. Conventional designs may deliver more power yet struggles to deliver sustainable scaling, and hence the need for a new powerful yet efficient solution for today and tomorrow's models.

RNGD (pronounced "Renegade"), FuriosaAI's second-generation tensor contraction processor, is built with the purpose of AI inference at its core. Through designing and developing a completely new architecture along with a complete software stack specifically for AI workloads, RNGD provides performance without the sacrifising efficiency. Dynamically configurable coarse-grained processing elements (PEs) can act as a single large compute unit or as multiple independent units, adapting to a wide range of tensor shapes and sizes. Through such architecture, RNGD aims and delivers flexible, powerful, and efficient computing of AI inference.This flexibility ensures efficient utilization across varying inference workloads.

In this session, we present the design and implementation of hardware-backed, open-source–compatible programming models and interfaces. The Tensor Contraction Processor (TCP) is a hardware–software co-designed architecture that introduces a novel approach to accelerating deep neural networks. Since tensor contraction lies at the core of machine learning workloads, TCP is built to efficiently compute multidimensional tensor contractions through programmable hardware.
To make this hardware accessible and practical for end users, we must go beyond providing a compiler and runtime system, a rich set of programming interfaces is essential. In this talk, we share our experiences, insights, and lessons learned from the trials and errors of designing these interfaces, and discuss what it takes to integrate such specialized hardware into widely used software ecosystems.

* 참석자에게 점심을 제공합니다.

연사 소개

June Paik is Founder and CEO of FuriosaAI. Before starting the journey of FuriosaAI in 2017, he had accumulated deep experience in developing world-class semiconductor products—from hardware to software—at major chip companies such as AMD and Samsung Electronics. As Founder and CEO of FuriosaAI, he is setting the product vision and orchestrating the engineering development of the world’s best AI chips. He received his master’s degree in electrical engineering from Georgia Tech.

Sol Kim is a Member of the Technical Staff at FuriosaAI and a seasonsed developer with 10+ years of software engineering, of which 6+ years of software engineering were in FuriosaAI. He designs and develops the FuriosaAI HAL (hardware abstraction layer & language) and runtime, a runtime which through software out-of-order achieves 99%+ of NPU utilization on LLM serving.

MPC algorithms for geometric proximity problems

Asymptotically Optimal Hardness for k-Set Packing and k-Matroid Intersection

세미나

[Distinguished Lecture Series] RNGD: Design and Implementation of PyTorch-Compatible Programming Interfaces for Tensor Contraction Processors

소식