Artwork

Nội dung được cung cấp bởi Adam Bien. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Adam Bien hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.
Player FM - Ứng dụng Podcast
Chuyển sang chế độ ngoại tuyến với ứng dụng Player FM !
icon Daily Deals

Exploring ONNX, Embedding Models, and Retrieval Augmented Generation (RAG) with Langchain4j

1:09:00
 
Chia sẻ
 

Manage episode 421443440 series 2469611
Nội dung được cung cấp bởi Adam Bien. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Adam Bien hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.
An airhacks.fm conversation with Dmytro Liubarskyi (@langchain4j) about:
Dmytro previously on "#285 How LangChain4j Happened", discussion about ONNX format and runtime for running neural network models in Java, using langchain4j library for seamless integration and data handling, embedding models for converting text into vector representations, strategies for handling longer text inputs by splitting and averaging embeddings, overview of the retrieval augmented generation (RAG) pipeline and its components, using embeddings for query transformation, routing, and data source selection in RAG, integrating Langchain4j with quarkus and CDI for building AI-powered applications, Langchain4j provides pre-packaged ONNX models as Maven dependencies, embedding models are faster and smaller compared to full language models, possibilities of using embeddings for query expansion, summarization, and data source selection, cross-checking model outputs using embeddings or another language model, decomposing complex AI services into smaller, specialized sub-modules, injecting the right tools and data based on query classification

Dmytro Liubarskyi on twitter: @langchain4j

  continue reading

341 tập

Artwork
iconChia sẻ
 
Manage episode 421443440 series 2469611
Nội dung được cung cấp bởi Adam Bien. Tất cả nội dung podcast bao gồm các tập, đồ họa và mô tả podcast đều được Adam Bien hoặc đối tác nền tảng podcast của họ tải lên và cung cấp trực tiếp. Nếu bạn cho rằng ai đó đang sử dụng tác phẩm có bản quyền của bạn mà không có sự cho phép của bạn, bạn có thể làm theo quy trình được nêu ở đây https://vi.player.fm/legal.
An airhacks.fm conversation with Dmytro Liubarskyi (@langchain4j) about:
Dmytro previously on "#285 How LangChain4j Happened", discussion about ONNX format and runtime for running neural network models in Java, using langchain4j library for seamless integration and data handling, embedding models for converting text into vector representations, strategies for handling longer text inputs by splitting and averaging embeddings, overview of the retrieval augmented generation (RAG) pipeline and its components, using embeddings for query transformation, routing, and data source selection in RAG, integrating Langchain4j with quarkus and CDI for building AI-powered applications, Langchain4j provides pre-packaged ONNX models as Maven dependencies, embedding models are faster and smaller compared to full language models, possibilities of using embeddings for query expansion, summarization, and data source selection, cross-checking model outputs using embeddings or another language model, decomposing complex AI services into smaller, specialized sub-modules, injecting the right tools and data based on query classification

Dmytro Liubarskyi on twitter: @langchain4j

  continue reading

341 tập

Tất cả các tập

×
 
Artwork

1
High-Performance Load Testing 1:10:10

icon
1:10:10
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:10:10icon
 
An airhacks.fm conversation with Francesco Nigro ( @forked_franz ) about: discussion about the importance of stress testing over System Tests and unit tests, Coordinated Omission Problem in load generators where they don't accurately measure server performance during slowdowns, introduction to HyperFoil as a high-performance load generator capable of generating millions of requests per second with just two cores, explanation of how HyperFoil avoids GC overhead by pre-allocating resources, the architecture of HyperFoil using Netty event loops and a graph-based execution model, comparison with other load testing tools like JMeter, K6 , Apache Benchmark and Vegeta , introduction to QDUP as a shell automation tool for distributed testing, overview of Horreum for performance test results storage and analysis, explanation of how these tools work together in Red Hat's performance testing pipeline, discussion of JCTools and its importance for GC-free concurrent data structures, the Universal Scalability Law and its application to load balancing algorithms, the pick-two-random algorithm for efficient resource allocation, the benefits of using JBang for easy one-line execution of HyperFoil, potential drawbacks of HyperFoil including ergonomics and JIT compilation warm-up issues, the possibility of using GraalVM native image to avoid JIT compilation delays Francesco Nigro on twitter: @forked_franz…
 
An airhacks.fm conversation with Burr Sutter ( @burrsutter ) about: discussion about integrating LLMs into enterprise Java applications, challenges with non-deterministic LLM outputs in deterministic code environments, limitations of chat interfaces for power users in enterprise settings, preference for form-based applications with prompts running behind the scenes, using LLMs to understand unstructured data while providing structured interfaces, maintaining existing CRUD systems while using LLMs for unstructured data like emails and support tickets, practical examples of using LLMs to generate code from business requirements, creating assistants with system messages and short user prompts, potential for embeddings to replace text prompts in the future, developer journey in learning LLM integration including prompts, tools, RAG , and agentic workflows, benefits of specialized agents over one general agent, using LLMs for code generation with limitations for complex use cases, hybrid approaches combining LLMs with human oversight, using LLMs for email routing and support case classification, potential for extracting knowledge from enterprise data sources like Confluence and SharePoint, quality assurance with LLM judges, discussion of small language models versus large ones, model distillation and fine-tuning for specific enterprise use cases, cost considerations for model training versus using off-the-shelf models with better tool invocation, prediction that models will become more efficient and run on commodity hardware in the future, focus on post-training inference and reliable results Burr Sutter on twitter: @burrsutter…
 
An airhacks.fm conversation with Volker Simonis ( @volker_simonis ) about: early computing experiences with Schneider CPC (Amstrad in UK) with Z80 CPU, CP/M operating system as an add-on that provided a real file system, programming in Basic and Turbo Pascal on early computers, discussion about gaming versus programming interests, using a 9-pin needle printer for school work, programming on pocket computers with BASIC in school, memories of Digital Research's CP/M and DR-DOS competing with MS-DOS , HiMEM memory management in early operating systems, programming in Logo language with turtle graphics and fractals, fascination with Lindenmayer systems (L-systems) for simulating biological growth patterns, interest in biology and carnivorous plants, transition to PCs with floppy disk drives, using SGI Iris workstations at university with IRIX operating system, early experiences with Linux installed from floppy disks, challenges of configuring X Window System, programming graphics on interlaced monitors, early work with HP using Tickle/Tk and python around 1993, first experiences with Java around version 0.8/0.9, attraction to Java's platform-independent networking and graphics capabilities, using Blackdown Java for Linux created by Johan Vos , freelance work creating Java applets for accessing databases of technical standards, PhD work creating software for analyzing parallel text corpora in multiple languages, developing internationalization and XML capabilities in Java Swing applications, career at Sun Microsystems porting MaxDB to Solaris, transition to SAP to work on JVM development, Adabas and MaxDB, reflections on ABAP programming language at SAP and its database-centric nature Volker Simonis on twitter: @volker_simonis…
 
Artwork

1
The Database Cloud 1:09:03

icon
1:09:03
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:09:03icon
 
An airhacks.fm conversation with Alvaro Hernandez ( @ahachete ) about: discussion about stackgres as a complete database cloud solution for PostgreSQL , kubernetes as an abstraction layer over infrastructure providing a programmable API, Stackgres offering high availability with primary and replica nodes using patroni , integrated connection pooling with PgBouncer , kubernetes operators and Custom Resource Definitions (CRDs) as a powerful way to extend Kubernetes, day two operations automated through CRDs including benchmarks and version upgrades, Stackgres supporting sharding with Citus for horizontal scaling similar to DynamoDB , Change Data Capture capabilities using embedded debezium , failover mechanisms taking typically 30 seconds with DNS updates, synchronous vs asynchronous replication options affecting data loss during failover, Stackgres being implemented in Java using quarkus , ContainerD as a programmable container runtime that can be used without Kubernetes, Stackgres offering multiple interfaces including CRDs, REST API, and a web console, considerations for running databases on Kubernetes vs cloud-managed services, the advantages of containerization for infrastructure, the challenges of multi-leader setups in PostgreSQL requiring conflict resolution, the value of Kubernetes for on-premises deployments vs cloud environments Alvaro Hernandez on twitter: @ahachete…
 
Artwork

1
From OCCAM and CSP to Java 1:09:08

icon
1:09:08
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:09:08icon
 
An airhacks.fm conversation with Kevlin Henney ( @KevlinHenney ) about: first computer was a Sinclair ZX81 with 1K of memory, programming in Basic and later Z80 assembly language, creating simulations like volcano explosions and n-body problems as a teenager, transitioning to the ZX Spectrum and other early home computers, studying physics at university but becoming more interested in programming, learning fortran , getting his first programming job at a small software house in Bristol where his boss had a wall of books on programming languages and paradigms, becoming self-taught through reading these books, developing an interest in AI and philosophy of mind which led to pursuing a master's degree in parallel computer systems, creating a virtual machine for a Lisp-based actor model, learning about the occam programming language based on Communicating sequential processes (CSP) for transputers, discovering Object-oriented programming and being fascinated by modularity and encapsulation, encountering Java in the mid-90s as a free downloadable language with platform independence, appreciating Java's familiar C-like syntax while offering object orientation without low-level concerns, using Java primarily for training and consultancy work rather than application development, discussing the evolution of Java features like inner classes (Java 1.1) and interfaces, explaining his unique perspective on interfaces coming from distributed systems experience with IDLs, reflecting on his work with various distributed Java technologies like RMI, Jini and JavaSpaces , continuing his career as an independent consultant, trainer and speaker with strong involvement in the patterns community Kevlin Henney on twitter: @KevlinHenney…
 
Artwork
 
An airhacks.fm conversation with Francesco Nigro ( @forked_franz ) about: Netty committer and performance engineer at Red Hat, discussion of Netty's history, focus on low-level core components like buffers and allocators in Netty, relationship between Vert.x and Netty where Vert.x provides a more opinionated and user-friendly abstraction over Netty, explanation of reactive back pressure implementation in Vert.x, performance advantages of Vert.x over Netty due to batching and reactive design, detailed explanation of IO_uring as a Linux-specific asynchronous I/O mechanism, comparison between event loop architecture and Project Loom for scalability, limitations of Loom when working with IO_uring due to design incompatibilities, discovery of a major Java type system scalability issue related to instance-of checks against interfaces, explanation of how this issue affected Hibernate performance, deep investigation using assembly-level analysis to identify the root cause, collaboration with Andrew Haley to fix the 20-year-old JDK issue, performance improvements of 2-3x after fixing the issue, discussion of CPU cache coherency problems in NUMA architectures, explanation of how container environments like kubernetes can worsen performance issues due to CPU scheduling, insights into how modern CPUs handle branch prediction and speculation, impact of branch misprediction on performance especially with memory access patterns, discussion of memory bandwidth limitations in AI/ML workloads, advantages of unified memory architectures like Apple M-series chips for AI inference Francesco Nigro on twitter: @forked_franz…
 
Artwork

1
From Kona Coffee Beans to Java 1:15:45

icon
1:15:45
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:15:45icon
 
An airhacks.fm conversation with Burr Sutter ( @burrsutter ) about: first computer: IBM PS/2 386SX funded by grandparents' Kona coffee sales, early passion for programming and problem-solving, self-taught C programming, database engine development as a student, transition from theater aspirations to computer science, work with Progress 4GL and Silverstream , shift to .net development, joining JBoss and Red Hat through acquisition, Mark Fleury's impactful "free don't suck" presentation, evolution of Java application servers and middleware technologies, enterprise service bus and SOA, impact of docker and kubernetes on the industry, Red Hat's adaptation to cloud-native technologies, development of quarkus , current interest in language models and GenAI , Java's longevity and adaptability, Quarkus' fast startup time and compatibility with legacy Java EE applications, work on Kubernetes and Quarkus, the importance of Java's "write once, run anywhere" principle, Java's performance compared to other languages Burr Sutter on twitter: @burrsutter…
 
Artwork

1
Pure Java Inception 1:03:08

icon
1:03:08
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:03:08icon
 
An airhacks.fm conversation with Christian Humer ( @grashalm_ ) about: early programming experiences with DOS text Adventures and Captain Comic , transition from graphics design to computer science, work on Java Server Pages (JSPs) and point-of-sale systems, development of Swing GUI for touchscreens, introduction to GraalVM and Truffle framework, ActionScript , Adobe Flash and Adobe Flex , explanation of Futamura projections and partial evaluation in Truffle, discussion on the challenges of implementing dynamic language runtimes, de-optimization in JIT compilers, Nashorn JavaScript engine vs. GraalJS , language interoperability in GraalVM, reuse of libraries across different programming languages, embedding of JavaScript and React in Java applications, comparison with PyPy in the python ecosystem, current work on bytecode DSL for generating bytecode interpreters, the importance of math in computer science and its relation to programming concepts Christian Humer on twitter: @grashalm_…
 
Artwork

1
From Swing to Blockchain 1:01:36

icon
1:01:36
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:01:36icon
 
An airhacks.fm conversation with Richard Bair ( @RichardBair ) about: early programming experiences with Basic and building computers, his first production app at 17 for his father's auto repair shop, starting computer science degree at 16 and completing it at 43, joining Sun Microsystems' Swing team, working on JavaFX from its inception as F3 through its evolution, becoming lead of JavaFX team and chief architect for client Java, moving to Oracle's IoT team, current role as VP of Engineering at Hedera blockchain company, explanation of Hedera's Hashgraph algorithm solving the Byzantine Generals Problem, implementation of Hedera's technology in Java, open-source nature of Hedera's codebase, resources for learning about and developing with Hedera including documentation, Hedera Improvement Proposals - hips , and developer hedera playground , discussion of blockchain technology and its potential impact on open protocols and decentralized networks, comparison of blockchain to distributed databases, explanation of consensus mechanisms in distributed ledgers, tokenization of real-world assets Richard Bair on twitter: @RichardBair…
 
An airhacks.fm conversation with Alvaro Hernandez ( @ahachete ) about: discussion on Postgres JDBC driver contributions, SCRAM authentication library implementation, importance of connection pooling for Postgres performance, tuning Postgres configuration, PgBouncer as a popular connection pooler, challenges with lambda and database connections, benefits of using connection poolers at multiple levels, the need for an HTTP-based protocol for PostgreSQL , PostgresSQL Configuration Tool by StackGres , distributed SQL databases like DSQL and their trade-offs, optimistic vs pessimistic locking in distributed databases, comparison of JPA optimistic locking to distributed database conflicts, the power of using SQL directly vs ORM frameworks, the evolution of Java and JDBC making direct database queries more convenient, the potential benefits of using stored procedures in databases, the importance of understanding database internals for optimal performance, the need for careful consideration when choosing between ORM and direct SQL queries, the complexities of distributed databases and their impact on application design Alvaro Hernandez on twitter: @ahachete…
 
Artwork

1
Natural Born Breaker 1:22:40

icon
1:22:40
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:22:40icon
 
An airhacks.fm conversation with Francesco Nigro ( @forked_franz ) about: starting with a used Commodore 64 without display, breakdancing as a hobby and its influence on his learning approach, studying computer science at university with a focus on AI and compilers, pursuing a PhD in reinforcement learning, transitioning to IoT and embedded system work, discovering high-performance computing and concurrency patterns like the Disruptor, contributing to open-source projects, persistence in joining Red Hat despite initial rejection, rewriting ActiveMQ Artemis journal, considering Hazelcast before ultimately choosing Red Hat, working on messaging and performance optimization at Red Hat, becoming the performance expert for quarkus , journey from assembly and C programming to Java performance optimization, the importance of understanding low-level details in high-level languages, the impact of container resources on Java JVM performance, the value of deep technical knowledge in the age of AI and LLMs, Francesco's current role at Red Hat focusing on Quarkus performance and scalability issues Francesco Nigro on twitter: @forked_franz…
 
Artwork

1
Just Another CDI Committer 1:02:23

icon
1:02:23
icon
iconicon Nghe Sau
iconicon Nghe Sau
iconicon Danh sách
iconicon Đã thích
1:02:23icon
 
An airhacks.fm conversation with Ladislav Thon ( @ladicek ) about: CDI history and evolution, transition from XML-based configuration to annotation-based dependency injection, introduction of CDI lite in version 4.0, differences between portable extensions and build-compatible extensions, Arc as Quarkus CDI implementation, challenges in implementing CDI at build time, new features in CDI 4.0 and 4.1 including lifecycle events and method invokers, comparison of CDI with other dependency injection frameworks, discussion on decorators, interceptors, and stereotypes in CDI, performance implications of CDI in Quarkus, Convention over Configuration in CDI, upcoming changes in CDI 5, removal of expression language dependency from CDI API, benefits of build-time oriented implementations like Quarkus, challenges in migrating portable extensions to build-compatible extensions, introduction of synthetic beans and observers, addition of priority support for stereotypes, improvements in invocation context API, ability to declare priority on producers in CDI 4.1, integration of CDI with application programming models, Convention over Configuration paired with dependency injection, performance considerations of CDI in Quarkus compared to manual dependency management Ladislav Thon on twitter: @ladicek…
 
An airhacks.fm conversation with Gerald Venzl ( @GeraldVenzl ) about: discussion on prepared statements and their benefits in Oracle databases, explanation of hard parsing vs soft parsing in database queries, overview of connection pooling and its importance in database performance, introduction to Oracle's Database Resident Connection Pool (DRCP), exploration of Oracle's support for serverless workloads, discussion on PL/SQL and JavaScript support in Oracle databases, brief mention of ADA programming language and its influence on PL/SQL, introduction to GraalVM and its role in Oracle databases, comparison of performance between PL/SQL and JavaScript in Oracle, mention of Oracle database support for ARM architecture including M1 Macs and Raspberry Pi 5, explanation of database sharding vs partitioning, discussion on the benefits of stored procedures for data-intensive operations Gerald Venzl on twitter: @GeraldVenzl…
 
An airhacks.fm conversation with Alvaro Hernandez ( @ahachete ) about: first computer experiences with Amstrad CPC 464 , early programming with Basic , university studies in Telecommunication Engineering, transition from PHP to Java development, creating an ERP system, attending JavaOne conferences, failed startup attempt with a mobile phone bill analysis app, specialization in PostgreSQL , founding ongres company, developing stackgres as a kubernetes operator for PostgreSQL, discussion about the benefits of open-source software and the "My Server, My Rules" philosophy, comparison of cloud-managed services vs. self-managed solutions, the importance of control and transparency in database management, Stackgres as a solution for running PostgreSQL as a service with full control, the use of Java in developing kubernetes operators , the shift from on-premises to cloud deployments and its implications for developers, the challenges of setting up and managing databases in the cloud, the benefits of serverless and managed services, the importance of understanding the underlying infrastructure in cloud deployments, the evolution of database management from dedicated teams to self-service models, the potential for new container-related products from Ongress, the recent popularity of Bluesky as a social media platform for the Java community Alvaro Hernandez on twitter: @ahachete…
 
An airhacks.fm conversation with Ladislav Thon ( @ladicek ) about: Didaktik Gama to Red Hat, early programming experiences with Basic and Karel , learning Pascal and C in school, working with Java in university and early career, joining Red Hat as a quality engineer for JBoss Enterprise Application Platform, testing clustering and load balancing with SmartFrog , transitioning to WildFly Swarm / Thorntail development, becoming Thorntail project lead, moving to quarkus development, involvement in CDI specification improvements, discussion about portable extensions in CDI, interest in science fiction literature, mention of favorite authors and books including Neal Stephenson's Anathem and The Expanse series Ladislav Thon on github: @ladicek…
 
Loading …

Chào mừng bạn đến với Player FM!

Player FM đang quét trang web để tìm các podcast chất lượng cao cho bạn thưởng thức ngay bây giờ. Đây là ứng dụng podcast tốt nhất và hoạt động trên Android, iPhone và web. Đăng ký để đồng bộ các theo dõi trên tất cả thiết bị.

 

icon Daily Deals
icon Daily Deals
icon Daily Deals

Hướng dẫn sử dụng nhanh

Nghe chương trình này trong khi bạn khám phá
Nghe