Join Free

Research and publish the best content.

The Future of Artificial Intelligence

210 views | +0 today

Tags
Current selected tag: 'efficient generative AI'. Clear

Carbin 1

efficient generative AI 1

ICLR 1

jonathan frankle 1

LLM 1

Mosaic ML 1

sparse DNN 1

The Future of Artificial Intelligence

Curated by Juliette Decugis

Your new post is loading...

Scooped by Juliette Decugis

Scoop.it!

From towardsdatascience.com - February 6, 5:18 PM

Juliette Decugis's insight:

The "lottery ticket hypothesis" shows empirical evidence that the performance of large deep learning models can be reproduced by smaller sub-networks within their architecture. Network pruning is therefore not only possible but can exist at the beginning of training in rare "lottery ticket" cases. Finding these sub-networks could help speed up training, reduce model size and overall help us understand deep learning better.

However, we still haven't found the loophole to train small efficient models from scratch. One key approach for model compression today relies on parent-teacher network where a smaller model learns from the weights of a pre-trained larger model. Many research startups and labs are in the race for scalable generative AI such as Mistral AI and Mosaic ML (recently acquired by Databricks), led by no other than Jonathan Frankle himself.

No comment yet.

Demystifying the Lottery Ticket Hypothesis in Deep Learning