IEICE globals.ieice.org Site

Author Search Result

[Author] Tae Jun HAM(2hit)

1-2hit

Layerweaver+: A QoS-Aware Layer-Wise DNN Scheduler for Multi-Tenant Neural Processing Units
Young H. OH Yunho JIN Tae Jun HAM Jae W. LEE

LETTER-Fundamentals of Information Systems

Pubricized:
2021/11/11
Vol:
E105-D No:2
Page(s):
427-431
Many cloud service providers employ specialized hardware accelerators, called neural processing units (NPUs), to accelerate deep neural networks (DNNs). An NPU scheduler is responsible for scheduling incoming user requests and required to satisfy the two, often conflicting, optimization goals: maximizing system throughput and satisfying quality-of-service (QoS) constraints (e.g., deadlines) of individual requests. We propose Layerweaver+, a low-cost layer-wise DNN scheduler for NPUs, which provides both high system throughput and minimal QoS violations. For a serving scenario based on the industry-standard MLPerf inference benchmark, Layerweaver+ significantly improves the system throughput by up to 266.7% over the baseline scheduler serving one DNN at a time.
Eager Memory Management for In-Memory Data Analytics
Hakbeom JANG Jonghyun BAE Tae Jun HAM Jae W. LEE

LETTER-Computer System

Pubricized:
2018/12/11
Vol:
E102-D No:3
Page(s):
632-636
- HTML
- PDF(615KB) >> Buy this Article
- Errata[Uploaded on April 1,2019]
This paper introduces e-spill, an eager spill mechanism, which dynamically finds the optimal spill-threshold by monitoring the GC time at runtime and thereby prevent expensive GC overhead. Our e-spill adopts a slow-start model to gradually increase the spill-threshold until it reaches the optimal point without substantial GCs. We prototype e-spill as an extension to Spark and evaluate it using six workloads on three different parallel platforms. Our evaluations show that e-spill improves performance by up to 3.80× and saves the cost of cluster operation on Amazon EC2 cloud by up to 51% over the baseline system following Spark Tuning Guidelines.

Author Search Result

[Author] Tae Jun HAM(2hit)

Layerweaver+: A QoS-Aware Layer-Wise DNN Scheduler for Multi-Tenant Neural Processing Units

Eager Memory Management for In-Memory Data Analytics

Latest Issue

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles