सर्वश्रेष्ठ Twiml पॉडकास्टस (2024)

1
Localizing and Editing Knowledge in LLMs with Peter Hase - #679 49:46

8d ago49:46

49:46

Today we're joined by Peter Hase, a fifth-year PhD student at the University of North Carolina NLP lab. We discuss "scalable oversight", and the importance of developing a deeper understanding of how large neural networks make decisions. We learn how matrices are probed by interpretability researchers, and explore the two schools of thought regardi…

1
Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678 48:27

15d ago48:27

48:27

Today we're joined by Jonas Geiping, a research group leader at the ELLIS Institute, to explore his paper: "Coercing LLMs to Do and Reveal (Almost) Anything". Jonas explains how neural networks can be exploited, highlighting the risk of deploying LLM agents that interact with the real world. We discuss the role of open models in enabling security r…

1
V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677 47:47

22d ago47:47

47:47

Today we’re joined by Mido Assran, a research scientist at Meta’s Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new model being billed as “the next step in Yann LeCun's vision” for true artificial reasoning. V-JEPA, the video version of Meta’s Joint Embedding Predictive Architecture, aims to bridge the gap between human…

1
Video as a Universal Interface for AI Reasoning with Sherry Yang - #676 49:34

29d ago49:34

49:34

Today we’re joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her new paper, "Video as the New Language for Real-World Decision Making,” which explores how generative video models can play a role similar to language models as a way to solve tasks in the real world. Sh…

1
Assessing the Risks of Open AI Models with Sayash Kapoor - #675 40:26

1M ago40:26

40:26

Today we’re joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his paper: "On the Societal Impact of Open Foundation Models.” We dig into the controversy around AI safety, the risks and benefits of releasing open model weights, and how we can establish common ground for as…

1
OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674 32:12

1M ago32:12

32:12

Today we’re joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source language model with 7 billion and 1 billion variants, but with a key difference compared to similar models offered by Meta, Mistral, and others. Namely, the fact that AI2 has also published the dataset …

1
Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673 25:03

2M ago25:03

25:03

Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation centers on Ben’s recent paper, “Why think step by step? Reasoning emerges from the locality of experience,” which he recently presented at NeurIPS 2023. In…

1
Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672 45:38

2M ago45:38

45:38

Today we're joined by Armineh Nourbakhsh of JP Morgan AI Research to discuss the development and capabilities of DocLLM, a layout-aware large language model for multimodal document understanding. Armineh provides a historical overview of the challenges of document AI and an introduction to the DocLLM model. Armineh explains how this model, distinct…

1
Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671 1:05:40

2M ago1:05:40

1:05:40

Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our NeurIPS 2024 series. In our conversation, Sanmi discusses his two recent award-winning papers. First, we dive into his paper, “Are Emergent Abilities of Large Language Models a Mirage?”. We discuss the different ways LLMs are evaluated and the excitement…

1
AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670 1:10:25

2M ago1:10:25

1:10:25

Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Trends 2024 series. In our conversation, Kamyar updates us on the latest developments in reinforcement learning (RL), and how the RL community is taking advantage of the abstract reasoning abilities of large language models (LLMs). Kamyar shares his insig…

1
Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669 35:29

3M ago35:29

35:29

Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs between relying solely on LLMs for retrieval tasks versus combining retrieval in vector databases and LLMs, the advantages and complexities of RAG wit…

1
Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao - #668 39:45

3M ago39:45

39:45

Today we’re joined by Ben Zhao, a Neubauer professor of computer science at the University of Chicago. In our conversation, we explore his research at the intersection of security and generative AI. We focus on Ben’s recent Fawkes, Glaze, and Nightshade projects, which use “poisoning” approaches to provide users with security and protection against…

1
Learning Transformer Programs with Dan Friedman - #667 38:48

3M ago38:48

38:48

Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mechanistic interpretability for transformer models, specifically his paper, Learning Transformer Programs. The LTP paper proposes modifications to the transformer architecture which allow transformer mo…

1
AI Trends 2024: Machine Learning & Deep Learning with Thomas Dietterich - #666 1:05:18

3M ago1:05:18

1:05:18

Today we continue our AI Trends 2024 series with a conversation with Thomas Dietterich, distinguished professor emeritus at Oregon State University. As you might expect, Large Language Models figured prominently in our conversation, and we covered a vast array of papers and use cases exploring current research into topics such as monolithic vs. mod…

1
AI Trends 2024: Computer Vision with Naila Murray - #665 52:01

4M ago52:01

52:01

Today we kick off our AI Trends 2024 series with a conversation with Naila Murray, director of AI research at Meta. In our conversation with Naila, we dig into the latest trends and developments in the realm of computer vision. We explore advancements in the areas of controllable generation, visual programming, 3D Gaussian splatting, and multimodal…

1
Are Vector DBs the Future Data Platform for AI? with Ed Anuff - #664 48:13

4M ago48:13

48:13

Today we’re joined by Ed Anuff, chief product officer at DataStax. In our conversation, we discuss Ed’s insights on RAG, vector databases, embedding models, and more. We dig into the underpinnings of modern vector databases (like HNSW and DiskANN) that allow them to efficiently handle massive and unstructured data sets, and discuss how they help us…

1
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663 46:49

4M ago46:49

46:49

Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick off our coverage of NeurIPS 2023. In our conversation with Markus, we cover his accepted papers at the conference, along with other work presented by Qualcomm AI Research scientists. Markus’ first paper, Quantizable Transformers: Removing Outliers by H…

1
Responsible AI in the Generative Era with Michael Kearns - #662 36:04

4M ago36:04

36:04

Today we’re joined by Michael Kearns, professor in the Department of Computer and Information Science at the University of Pennsylvania and an Amazon scholar. In our conversation with Michael, we discuss the new challenges to responsible AI brought about by the generative AI era. We explore Michael’s learnings and insights from the intersection of …

1
Edutainment for AI and AWS PartyRock with Mike Miller - #661 29:46

4M ago29:46

29:46

Today we’re joined by Mike Miller, director of product at AWS responsible for the company’s “edutainment” products. In our conversation with Mike, we explore AWS PartyRock, a no-code generative AI app builder that allows users to easily create fun and shareable AI applications by selecting a model, chaining prompts together, and linking different t…

1
Data, Systems and ML for Visual Understanding with Cody Coleman - #660 38:27

4M ago38:27

38:27

Today we’re joined by Cody Coleman, co-founder and CEO of Coactive AI. In our conversation with Cody, we discuss how Coactive has leveraged modern data, systems, and machine learning techniques to deliver its multimodal asset platform and visual search tools. Cody shares his expertise in the area of data-centric AI, and we dig into techniques like …

1
Patterns and Middleware for LLM Applications with Kyle Roche - #659 35:58

4M ago35:58

35:58

Today we’re joined by Kyle Roche, founder and CEO of Griptape to discuss patterns and middleware for LLM applications. We dive into the emerging patterns for developing LLM applications, such as off prompt data—which allows data retrieval without compromising the chain of thought within language models—and pipelines, which are sequential tasks that…

1
AI Access and Inclusivity as a Technical Challenge with Prem Natarajan - #658 41:46

4M ago41:46

41:46

Today we’re joined by Prem Natarajan, chief scientist and head of enterprise AI at Capital One. In our conversation, we discuss AI access and inclusivity as technical challenges and explore some of Prem and his team’s multidisciplinary approaches to tackling these complexities. We dive into the issues of bias, dealing with class imbalances, and the…

1
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657 43:23

5M ago43:23

43:23

Today we’re joined by Jay Emery, director of technical sales & architecture at Microsoft Azure. In our conversation with Jay, we discuss the challenges faced by organizations when building LLM-based applications, and we explore some of the techniques they are using to overcome them. We dive into the concerns around security, data privacy, cost mana…

1
Visual Generative AI Ecosystem Challenges with Richard Zhang - #656 40:40

5M ago40:40

40:40

Today we’re joined by Richard Zhang, senior research scientist at Adobe Research. In our conversation with Richard, we explore the research challenges that arise when regarding visual generative AI from an ecosystem perspective, considering the disparate needs of creators, consumers, and contributors. We start with his work on perceptual metrics an…

1
Deploying Edge and Embedded AI Systems with Heather Gorr - #655 38:36

5M ago38:36

38:36

Today we’re joined by Heather Gorr, principal MATLAB product marketing manager at MathWorks. In our conversation with Heather, we discuss the deployment of AI models to hardware devices and embedded AI systems. We explore factors to consider during data preparation, model development, and ultimately deployment, to ensure a successful project. Facto…

1
AI Sentience, Agency and Catastrophic Risk with Yoshua Bengio - #654 48:00

5M ago48:00

48:00

Today we’re joined by Yoshua Bengio, professor at Université de Montréal. In our conversation with Yoshua, we discuss AI safety and the potentially catastrophic risks of its misuse. Yoshua highlights various risks and the dangers of AI being used to manipulate people, spread disinformation, cause harm, and further concentrate power in society. We d…

1
Delivering AI Systems in Highly Regulated Environments with Miriam Friedel - #653 44:05

6M ago44:05

44:05

Today we’re joined by Miriam Friedel, senior director of ML engineering at Capital One. In our conversation with Miriam, we discuss some of the challenges faced when delivering machine learning tools and systems in highly regulated enterprise environments, and some of the practices her teams have adopted to help them operate with greater speed and …

1
Mental Models for Advanced ChatGPT Prompting with Riley Goodside - #652 39:58

6M ago39:58

39:58

Today we’re joined by Riley Goodside, staff prompt engineer at Scale AI. In our conversation with Riley, we explore LLM capabilities and limitations, prompt engineering, and the mental models required to apply advanced prompting techniques. We dive deep into understanding LLM behavior, discussing the mechanism of autoregressive inference, comparing…

1
Multilingual LLMs and the Values Divide in AI with Sara Hooker - #651 1:18:39

6M ago1:18:39

1:18:39

Today we’re joined by Sara Hooker, director at Cohere and head of Cohere For AI, Cohere’s research lab. In our conversation with Sara, we explore some of the challenges with multilingual models like poor data quality and tokenization, and how they rely on data augmentation and preference training to address these bottlenecks. We also discuss the di…

1
Scaling Multi-Modal Generative AI with Luke Zettlemoyer - #650 38:44

6M ago38:44

38:44

Today we’re joined by Luke Zettlemoyer, professor at University of Washington and a research manager at Meta. In our conversation with Luke, we cover multimodal generative AI, the effect of data on models, and the significance of open source and open science. We explore the grounding problem, the need for visual grounding and embodiment in text-bas…

1
Pushing Back on AI Hype with Alex Hanna - #649 49:26

7M ago49:26

49:26

Today we’re joined by Alex Hanna, the Director of Research at the Distributed AI Research Institute (DAIR). In our conversation with Alex, we discuss the topic of AI hype and the importance of tackling the issues and impacts it has on society. Alex highlights how the hype cycle started, concerning use cases, incentives driving people towards the ra…

1
Personalization for Text-to-Image Generative AI with Nataniel Ruiz - #648 44:22

7M ago44:22

44:22

Today we’re joined by Nataniel Ruiz, a research scientist at Google. In our conversation with Nataniel, we discuss his recent work around personalization for text-to-image AI models. Specifically, we dig into DreamBooth, an algorithm that enables “subject-driven generation,” that is, the creation of personalized generative models using a small set …

1
Ensuring LLM Safety for Production Applications with Shreya Rajpal - #647 40:52

7M ago40:52

40:52

Today we’re joined by Shreya Rajpal, founder and CEO of Guardrails AI. In our conversation with Shreya, we discuss ensuring the safety and reliability of language models for production applications. We explore the risks and challenges associated with these models, including different types of hallucinations and other LLM failure modes. We also talk…

1
What’s Next in LLM Reasoning? with Roland Memisevic - #646 59:00

7M ago59:00

59:00

Today we’re joined by Roland Memisevic, a senior director at Qualcomm AI Research. In our conversation with Roland, we discuss the significance of language in humanlike AI systems and the advantages and limitations of autoregressive models like Transformers in building them. We cover the current and future role of recurrence in LLM reasoning and th…

1
Is ChatGPT Getting Worse? with James Zou - #645 42:17

8M ago42:17

42:17

Today we’re joined by James Zou, an assistant professor at Stanford University. In our conversation with James, we explore the differences in ChatGPT’s behavior over the last few months. We discuss the issues that can arise from inconsistencies in generative AI models, how he tested ChatGPT’s performance in various tasks, drawing comparisons betwee…

1
Why Deep Networks and Brains Learn Similar Features with Sophia Sanborn - #644 45:15

8M ago45:15

45:15

Today we’re joined by Sophia Sanborn, a postdoctoral scholar at the University of California, Santa Barbara. In our conversation with Sophia, we explore the concept of universality between neural representations and deep neural networks, and how these principles of efficiency provide an ability to find consistent features across networks and tasks.…

1
Inverse Reinforcement Learning Without RL with Gokul Swamy - #643 33:55

8M ago33:55

33:55

Today we’re joined by Gokul Swamy, a Ph.D. Student at the Robotics Institute at Carnegie Mellon University. In the final conversation of our ICML 2023 series, we sat down with Gokul to discuss his accepted papers at the event, leading off with “Inverse Reinforcement Learning without Reinforcement Learning.” In this paper, Gokul explores the challen…

1
Explainable AI for Biology and Medicine with Su-In Lee - #642 38:14

8M ago38:14

38:14

Today we’re joined by Su-In Lee, a professor at the Paul G. Allen School of Computer Science And Engineering at the University Of Washington. In our conversation, Su-In details her talk from the ICML 2023 Workshop on Computational Biology which focuses on developing explainable AI techniques for the computational biology and clinical medicine field…

1
Transformers On Large-Scale Graphs with Bayan Bruss - #641 38:36

8M ago38:36

38:36

Today we’re joined by Bayan Bruss, Vice President of Applied ML Research at Capital One. In our conversation with Bayan, we covered a pair of papers his team presented at this year’s ICML conference. We begin with the paper Interpretable Subspaces in Image Representations, where Bayan gives us a dive deep into the interpretability framework, embedd…

1
The Enterprise LLM Landscape with Atul Deo - #640 37:08

9M ago37:08

37:08

Today we’re joined by Atul Deo, General Manager of Amazon Bedrock. In our conversation with Atul, we discuss the process of training large language models in the enterprise, including the pain points of creating and training machine learning models, and the power of pre-trained models. We explore different approaches to how companies can leverage l…

1
BloombergGPT - an LLM for Finance with David Rosenberg - #639 36:52

9M ago36:52

36:52

Today we’re joined by David Rosenberg, head of the machine learning strategy team in the Office of the CTO at Bloomberg. In our conversation with David, we discuss the creation of BloombergGPT, a custom-built LLM focused on financial applications. We explore the model’s architecture, validation process, benchmarks, and its distinction from other la…

1
Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638 48:21

9M ago48:21

48:21

Today we’re joined by Robert Osazuwa Ness, a senior researcher at Microsoft Research, Professor at Northeastern University, and Founder of Altdeep.ai. In our conversation with Robert, we explore whether large language models, specifically GPT-3, 3.5, and 4, are good at causal reasoning. We discuss the benchmarks used to evaluate these models and th…

1
Privacy vs Fairness in Computer Vision with Alice Xiang - #637 37:41

9M ago37:41

37:41

Today we’re joined by Alice Xiang, Lead Research Scientist at Sony AI, and Global Head of AI Ethics at Sony Group Corporation. In our conversation with Alice, we discuss the ongoing debate between privacy and fairness in computer vision, diving into the impact of data privacy laws on the AI space while highlighting concerns about unauthorized use a…

1
Unifying Vision and Language Models with Mohit Bansal - #636 48:08

10M ago48:08

48:08

Today we're joined by Mohit Bansal, Parker Professor, and Director of the MURGe-Lab at UNC, Chapel Hill. In our conversation with Mohit, we explore the concept of unification in AI models, highlighting the advantages of shared knowledge and efficiency. He addresses the challenges of evaluation in generative AI, including biases and spurious correla…

1
Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635 52:31

10M ago52:31

52:31

Today we kick off our coverage of the 2023 CVPR conference joined by Fatih Porikli, a Senior Director of Technology at Qualcomm. In our conversation with Fatih, we covered quite a bit of ground, touching on a total of 12 papers/demos, focusing on topics like data augmentation and optimized architectures for computer vision. We explore advances in o…

1
Mojo: A Supercharged Python for AI with Chris Lattner - #634 57:22

10M ago57:22

57:22

Today we’re joined by Chris Lattner, Co-Founder and CEO of Modular. In our conversation with Chris, we discuss Mojo, a new programming language for AI developers. Mojo is unique in this space and simplifies things by making the entire stack accessible and understandable to people who are not compiler engineers. It also offers Python programmers the…

1
Stable Diffusion and LLMs at the Edge with Jilei Hou - #633 40:09

10M ago40:09

40:09

Today we’re joined by Jilei Hou, a VP of Engineering at Qualcomm Technologies. In our conversation with Jilei, we focus on the emergence of generative AI, and how they've worked towards providing these models for use on edge devices. We explore how the distribution of models on devices can help amortize large models' costs while improving reliabili…

1
Modeling Human Behavior with Generative Agents with Joon Sung Park - #632 46:38

11M ago46:38

46:38

Today we’re joined by Joon Sung Park, a PhD Student at Stanford University. Joon shares his passion for creating AI systems that can solve human problems and his work on the recent paper Generative Agents: Interactive Simulacra of Human Behavior, which showcases generative agents that exhibit believable human behavior. We discuss using empirical me…

1
Towards Improved Transfer Learning with Hugo Larochelle - #631 38:52

11M ago38:52

38:52

Today we’re joined by Hugo Larochelle, a research scientist at Google Deepmind. In our conversation with Hugo, we discuss his work on transfer learning, understanding the capabilities of deep learning models, and creating the Transactions on Machine Learning Research journal. We explore the use of large language models in NLP, prompting, and zero-s…

1
Language Modeling With State Space Models with Dan Fu - #630 28:15

11M ago28:15

28:15

Today we’re joined by Dan Fu, a PhD student at Stanford University. In our conversation with Dan, we discuss the limitations of state space models in language modeling and the search for alternative building blocks that can help increase context length without being computationally infeasible. Dan walks us through the H3 architecture and Flash Atte…

पॉडकास्ट सुनने लायक

Twiml पॉडकास्टस

पॉडकास्ट सुनने लायक

त्वरित संदर्भ मार्गदर्शिका