RepCNN: Micro-Sized, Mighty Models for Wakeword Detection

Always-on machine learning models require a very low memory and compute footprint. Their restricted parameter count limits the model’s capacity to learn, and the effectiveness of the usual training algorithms to find the best parameters. Here we show that a small convolutional model can be better trained by first refactoring its computation into a larger redundant multi-branched architecture. Then, for inference, we algebraically re-parameterize the trained model into the single-branched form with fewer parameters for a lower memory footprint and compute cost. Using this technique, we show that our always-on wake-word detector model, RepCNN, provides a good trade-off between latency and accuracy during inference. RepCNN re-parameterized models are 43{7df079fc2838faf5776787b4855cb970fdd91ea41b0d21e47918e41b3570aafe} more accurate than a uni-branch convolutional model while having the same runtime. RepCNN also meets the accuracy of complex architectures like BC-ResNet, while having 2x lesser peak memory usage and 10x faster runtime.

Source link

State of the Art Artificial Intelligence! – Tiny AI (Just a test)

A model of virtuosity | MIT News

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Towards Low-Bit Communication for Tensor Parallel LLM Inference

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Top 10 Applications Of Artificial Intelligence in 2021

Meet FluidML: A Generic Runtime Memory Management and Optimization Framework for Faster, Smarter Machine Learning Inference

Yaroslav Ivanov, CVO at ALTA — Blockchain Leadership, Global AI Adoption, Visionary Strategies, Regulatory Challenges, Key AI Applications, Web3 Decisions, Ethical AI, Future Trends, Career Insights – AI Time Journal

RepCNN: Micro-Sized, Mighty Models for Wakeword Detection

Related News