Authors
Seungyeop Han, Haichen Shen, Matthai Philipose, Sharad Agarwal, Alec Wolman, Arvind Krishnamurthy
Publication date
2016/6/20
Book
Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services
Pages
123-136
Description
We consider applying computer vision to video on cloud-backed mobile devices using Deep Neural Networks (DNNs). The computational demands of DNNs are high enough that, without careful resource management, such applications strain device battery, wireless data, and cloud cost budgets. We pose the corresponding resource management problem, which we call Approximate Model Scheduling, as one of serving a stream of heterogeneous (i.e., solving multiple classification problems) requests under resource constraints. We present the design and implementation of an optimizing compiler and runtime scheduler to address this problem. Going beyond traditional resource allocators, we allow each request to be served approximately, by systematically trading off DNN classification accuracy for resource use, and remotely, by reasoning about on-device/cloud execution trade-offs. To inform the resource …
Total citations
20162017201820192020202120222023202432252606487795216
Scholar articles
S Han, H Shen, M Philipose, S Agarwal, A Wolman… - Proceedings of the 14th Annual International …, 2016