Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
Manage episode 442378268 series 3524393
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks.
https://arxiv.org/abs//2409.17115
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1687 tập
Manage episode 442378268 series 3524393
The paper introduces PROX, a framework enabling small language models to refine data effectively, outperforming human-crafted methods and enhancing efficiency in LLM pre-training across various benchmarks.
https://arxiv.org/abs//2409.17115
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--- Support this podcast: https://podcasters.spotify.com/pod/show/arxiv-papers/support
1687 tập
Todos os episódios
×Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.