Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
History
Music
About Us
Contact Us
Copyright
© 2024 PodJoint
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/fc/48/08/fc480827-6109-5bf6-c47a-c842949c6ef9/mza_17693176697459781715.jpg/600x600bb.jpg
Epikurious
Alejandro Santamaria Arza
15 episodes
3 days ago
Cravings of knowledge around tech, AI and the mind
Show more...
Tech News
News
RSS
All content for Epikurious is the property of Alejandro Santamaria Arza and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
Cravings of knowledge around tech, AI and the mind
Show more...
Tech News
News
https://d3t3ozftmdmh3i.cloudfront.net/staging/podcast_uploaded_nologo/42513579/42513579-1732431020227-e2bfc8a7a1b3a.jpg
From Training to Thinking: Optimizing AI for Real-World Challenges
Epikurious
15 minutes 35 seconds
1 year ago
From Training to Thinking: Optimizing AI for Real-World Challenges

Summary: This research paper explores how to optimally increase the computational resources used by large language models (LLMs) during inference, rather than solely focusing on increasing model size during training. The authors investigate two main strategies: refining the model's output iteratively (revisions) and employing improved search algorithms with a process-based verifier (PRM). They find that a "compute-optimal" approach, adapting the strategy based on prompt difficulty, significantly improves efficiency and can even outperform much larger models in certain scenarios. Their experiments using the MATH benchmark and PaLM 2 models show that test-time compute scaling can be a more effective alternative to increasing model parameters, especially for easier problems or those with lower inference token requirements. However, for extremely difficult problems, increased pre-training compute remains superior.

Epikurious
Cravings of knowledge around tech, AI and the mind