Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
Podcast đáng để nghe
TÀI TRỢ BỞI


1 Family Secrets: Chris Pratt & Millie Bobby Brown Share Stories From Set 22:08
Is Apache Spark Too Costly? An Amazon Engineer Tells His Story
Manage episode 451267089 series 2574278
Is Apache Spark too costly? Amazon Principal Engineer Patrick Ames tackled this question during an interview with The New Stack Makers, sharing insights into transitioning from Spark to Ray for managing large-scale data. Ames, described as a "go-to" engineer for exabyte-scale projects, emphasized a goal-driven approach to solving complex engineering problems, from simplifying daily chores to optimizing software solutions.
Initially, Spark was chosen at Amazon for its simplicity and open-source flexibility, allowing efficient merging of data with minimal SQL code. The team leveraged Spark in a decoupled architecture over S3 storage, scaling it to handle thousands of jobs daily. However, as data volumes grew to hundreds of terabytes and beyond, Spark’s limitations became apparent. Long processing times and high costs prompted a search for alternatives.
Enter Ray—a unified framework designed for scaling AI and Python applications. After experimentation, Ames and his team noted significant efficiency improvements, driving the shift from Spark to Ray to meet scalability and cost-efficiency needs.
Learn more from The New Stack about Apache Spark and Ray:
Amazon to Save Millions Moving From Apache Spark to Ray
How Ray, a Distributed AI Framework, Helps Power ChatGPT
Join our community of newsletter subscribers to stay on top of the news and at the top of your game.
301 tập
Manage episode 451267089 series 2574278
Is Apache Spark too costly? Amazon Principal Engineer Patrick Ames tackled this question during an interview with The New Stack Makers, sharing insights into transitioning from Spark to Ray for managing large-scale data. Ames, described as a "go-to" engineer for exabyte-scale projects, emphasized a goal-driven approach to solving complex engineering problems, from simplifying daily chores to optimizing software solutions.
Initially, Spark was chosen at Amazon for its simplicity and open-source flexibility, allowing efficient merging of data with minimal SQL code. The team leveraged Spark in a decoupled architecture over S3 storage, scaling it to handle thousands of jobs daily. However, as data volumes grew to hundreds of terabytes and beyond, Spark’s limitations became apparent. Long processing times and high costs prompted a search for alternatives.
Enter Ray—a unified framework designed for scaling AI and Python applications. After experimentation, Ames and his team noted significant efficiency improvements, driving the shift from Spark to Ray to meet scalability and cost-efficiency needs.
Learn more from The New Stack about Apache Spark and Ray:
Amazon to Save Millions Moving From Apache Spark to Ray
How Ray, a Distributed AI Framework, Helps Power ChatGPT
Join our community of newsletter subscribers to stay on top of the news and at the top of your game.
301 tập
All episodes
×

1 Kong’s AI Gateway Aims to Make Building with AI Easier 21:05


1 What’s the Future of Platform Engineering? 26:44


1 AI Agents are Dumb Robots, Calling LLMs 28:31




1 How Generative AI Is Reshaping the SDLC 21:42


1 OAuth Works for AI Agents but Scaling is Another Question 25:36


1 LLMs and AI Agents Evolving Like Programming Languages 28:08


1 Writing Code About Your Infrastructure? That's a Losing Race 31:21


1 OpenTelemetry: What’s New with the 2nd Biggest CNCF Project? 30:14


1 What’s Driving the Rising Cost of Observability? 24:55


1 How Oracle Is Meeting the Infrastructure Needs of AI 27:28


1 Arm: See a Demo About Migrating a x86-Based App to ARM64 21:28


1 Heroku Moved Twelve-Factor Apps to Open Source. What’s Next? 22:54


1 How Falco Brought Real-Time Observability to Infrastructure 19:27


1 How cert-manager Got to 500 Million Downloads a Month 23:18
Chào mừng bạn đến với Player FM!
Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.