The Cloud-Edge AI Continuum
- Wednesday, April 26th 2023, 18:30 - 20:00
- Amazon Development Center Iași, Palas Campus
While deep ML models have been designed and operated mostly in the cloud, where there is enough computation power for their needs, we are starting to see a proliferation of ML models at the edge, on lower power devices. Increasing privacy awareness and latency are driving this trend of consuming data closer to its source.
Model compression and neural architecture search have been some of the most common approaches for providing ML models ready for the edge. But no matter how hard we try, producing tight models for the more resource constrained devices will not be possible without compromising on accuracy.
In this talk, Valentin will focus on hybrid inference through splitting the model computations between the edge and the cloud. We ensure user data privacy by running the early computations of the model with raw data only on the edge. We will explore solutions for model inference but also training between the edge and cloud.