Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Hosted by Arize AI founders and engineers, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning.
…
continue reading
Thoughtful discussions about current topics, moderated by American Banker editors.
…
continue reading
Relevant, Inspirational, and Transformative. Explore Judaism with Rabbi Einhorn
…
continue reading
1
Breaking Down Meta's Llama 3 Herd of Models
44:40
44:40
Play later
Play later
Lists
Like
Liked
44:40
Meta just released Llama 3.1 405B–according to them, it’s “the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation.” Will the latest Llama herd ignite new applications and modeling paradigms like synthetic data gene…
…
continue reading
1
Some banks are making a Faustian bargain with fintechs: Karen Petrou
18:57
18:57
Play later
Play later
Lists
Like
Liked
18:57
Karen Petrou, the managing partner at Federal Financial Analytics and a long-time observer of banking and regulation, says banks need to do far more due diligence on potential fintech partners and exert more control over these relationships.
…
continue reading
1
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines
33:57
33:57
Play later
Play later
Lists
Like
Liked
33:57
Chaining language model (LM) calls as composable modules is fueling a new way of programming, but ensuring LMs adhere to important constraints requires heuristic “prompt engineering.” The paper this week introduces LM Assertions, a programming construct for expressing computational constraints that LMs should satisfy. The researchers integrated the…
…
continue reading
1
What military members need from their banks
25:05
25:05
Play later
Play later
Lists
Like
Liked
25:05
Two veterans and executives at Armed Forces Bank – Tom McLean and Jodi Vickery – share the challenges they see their customers face and new products the bank has rolled out this year to better serve them.
…
continue reading
1
Regulators are wise to be more careful’ after Chevron ruling
47:07
47:07
Play later
Play later
Lists
Like
Liked
47:07
Gene Scalia, the banking lobby’s lawyer on retainer for a potential challenge to Washington’s capital reform effort, discusses the state of administrative law after the overturning of a key legal precedent.
…
continue reading
1
RAFT: Adapting Language Model to Domain Specific RAG
44:01
44:01
Play later
Play later
Lists
Like
Liked
44:01
Where adapting LLMs to specialized domains is essential (e.g., recent news, enterprise private documents), we discuss a paper that asks how we adapt pre-trained LLMs for RAG in specialized domains. SallyAnn DeLucia is joined by Sai Kolasani, researcher at UC Berkeley’s RISE Lab (and Arize AI Intern), to talk about his work on RAFT: Adapting Languag…
…
continue reading
1
Climate First Bank's plans to expand nationwide
30:17
30:17
Play later
Play later
Lists
Like
Liked
30:17
The Florida bank has emerged from de novo status and can now offer its solar loans in a larger geographic footprint.
…
continue reading
1
LLM Interpretability and Sparse Autoencoders: Research from OpenAI and Anthropic
44:00
44:00
Play later
Play later
Lists
Like
Liked
44:00
It’s been an exciting couple weeks for GenAI! Join us as we discuss the latest research from OpenAI and Anthropic. We’re excited to chat about this significant step forward in understanding how LLMs work and the implications it has for deeper understanding of the neural activity of language models. We take a closer look at some recent research from…
…
continue reading
1
Can data ownership be preserved in generative AI?
16:46
16:46
Play later
Play later
Lists
Like
Liked
16:46
Foundational models like GPT-4, the large language model behind ChatGPT, have hoovered up content from publications like The New York Times and social media sites like Reddit and OpenAI, and it faces several lawsuits because of this. John Thompson, global head of artificial intelligence at EY and author of the book Data for All, has set up what is …
…
continue reading
1
Trustworthy LLMs: A Survey and Guideline for Evaluating Large Language Models' Alignment
48:07
48:07
Play later
Play later
Lists
Like
Liked
48:07
We break down the paper--Trustworthy LLMs: A Survey and Guideline for Evaluating Large Language Models' Alignment. Ensuring alignment (aka: making models behave in accordance with human intentions) has become a critical task before deploying LLMs in real-world applications. However, a major challenge faced by practitioners is the lack of clear guid…
…
continue reading
1
What might digital identity look like in the future?
21:30
21:30
Play later
Play later
Lists
Like
Liked
21:30
Proof of identity is critical for many things, including being able to open a bank account, get a job, or obtain health care. Yet proving one’s identity is getting harder in a world of frequent data breaches. We asked Mariana Dahan, founder of the World Identity Network and chair of the Universal ID Council, what she thinks will solve this problem.…
…
continue reading
1
Breaking Down EvalGen: Who Validates the Validators?
44:47
44:47
Play later
Play later
Lists
Like
Liked
44:47
Due to the cumbersome nature of human evaluation and limitations of code-based evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in evaluating LLM outputs. Yet LLM-generated evaluators often inherit the problems of the LLMs they evaluate, requiring further human validation. This week’s paper explores EvalGen, a m…
…
continue reading
1
“The law was very clear” inside the Fed master account debate
36:53
36:53
Play later
Play later
Lists
Like
Liked
36:53
Custodia Founder and CEO Caitlin Long says the Federal Reserve has rewritten the rules around accessing the government's payments system. The central bank and a federal court judge disagree. Editor’s note: This conversation was recorded on April 17. On April 26, Custodia Bank filed a notice of appeal, signaling that it will challenge the district c…
…
continue reading
1
Keys To Understanding ReAct: Synergizing Reasoning and Acting in Language Models
45:07
45:07
Play later
Play later
Lists
Like
Liked
45:07
This week we explore ReAct, an approach that enhances the reasoning and decision-making capabilities of LLMs by combining step-by-step reasoning with the ability to take actions and gather information from external sources in a unified framework. To learn more about ML observability, join the Arize AI Slack community or get the latest on our Linked…
…
continue reading
1
'There are risks': Betsy Cohen on banking as a service
17:17
17:17
Play later
Play later
Lists
Like
Liked
17:17
Banking as a service is expensive, it takes time and onboarding has to be done carefully, says the founder of Bancorp Bank, who now runs a venture capital firm that invests in fintechs.
…
continue reading
Cloud-based tech giants like Amazon, Google and Uber are changing the economy, and not for the better, asserts Yanis Varoufakis, a former finance minister of Greece and a professor at the University of Athens, who has written a book about the dangers of what he calls the "cloudalists."
…
continue reading
1
Demystifying Chronos: Learning the Language of Time Series
44:40
44:40
Play later
Play later
Lists
Like
Liked
44:40
This week, we’ve covering Amazon’s time series model: Chronos. Developing accurate machine-learning-based forecasting models has traditionally required substantial dataset-specific tuning and model customization. Chronos however, is built on a language model architecture and trained with billions of tokenized time series observations, enabling it t…
…
continue reading
1
‘Not all fintech is good for people’: Jennifer Tescher
25:45
25:45
Play later
Play later
Lists
Like
Liked
25:45
Financial technology startups have developed some useful technology for consumers, such as automated savings, says Tescher, who founded the Financial Health Network 20 years ago. But some fintech innovations are more questionable.
…
continue reading
This week we dive into the latest buzz in the AI world – the arrival of Claude 3. Claude 3 is the newest family of models in the LLM space, and Opus Claude 3 ( Anthropic's "most intelligent" Claude model ) challenges the likes of GPT-4. The Claude 3 family of models, according to Anthropic "sets new industry benchmarks," and includes "three state-o…
…
continue reading
1
Reinforcement Learning in the Era of LLMs
44:49
44:49
Play later
Play later
Lists
Like
Liked
44:49
We’re exploring Reinforcement Learning in the Era of LLMs this week with Claire Longo, Arize’s Head of Customer Success. Recent advancements in Large Language Models (LLMs) have garnered wide attention and led to successful products such as ChatGPT and GPT-4. Their proficiency in adhering to instructions and delivering harmless, helpful, and honest…
…
continue reading
1
How banks are helping the fight against illegal wildlife trading
26:17
26:17
Play later
Play later
Lists
Like
Liked
26:17
Geraldine Fleming, financial task force manager at United for Wildlife and Jonny Bell, director, EMEA, LexisNexis Risk Solutions explain how banks around the world are helping to catch criminals who illegally mutilate, kill and sell rhinoceroses, elephants, donkeys and other animals.
…
continue reading
1
Sora: OpenAI’s Text-to-Video Generation Model
45:08
45:08
Play later
Play later
Lists
Like
Liked
45:08
This week, we discuss the implications of Text-to-Video Generation and speculate as to the possibilities (and limitations) of this incredible technology with some hot takes. Dat Ngo, ML Solutions Engineer at Arize, is joined by community member and AI Engineer Vibhu Sapra to review OpenAI’s technical report on their Text-To-Video Generation Model: …
…
continue reading
1
‘Don’t fall for the sales pitches’: Advice on deploying AI
25:38
25:38
Play later
Play later
Lists
Like
Liked
25:38
Eric Siegel, author of the book The AI Playbook, explains what it takes to take traditional and advanced artificial intelligence projects from idea to execution.
…
continue reading
This week, we’re discussing "RAG vs Fine-Tuning: Pipelines, Tradeoff, and a Case Study on Agriculture." This paper explores a pipeline for fine-tuning and RAG, and presents the tradeoffs of both for multiple popular LLMs, including Llama2-13B, GPT-3.5, and GPT-4. The authors propose a pipeline that consists of multiple stages, including extracting …
…
continue reading
We dive into Phi-2 and some of the major differences and use cases for a small language model (SLM) versus an LLM. With only 2.7 billion parameters, Phi-2 surpasses the performance of Mistral and Llama-2 models at 7B and 13B parameters on various aggregated benchmarks. Notably, it achieves better performance compared to 25x larger Llama-2-70B model…
…
continue reading
1
HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels
36:22
36:22
Play later
Play later
Lists
Like
Liked
36:22
We discuss HyDE: a thrilling zero-shot learning technique that combines GPT-3’s language understanding with contrastive text encoders. HyDE revolutionizes information retrieval and grounding in real-world data by generating hypothetical documents from queries and retrieving similar real-world documents. It outperforms traditional unsupervised retri…
…
continue reading
1
How challenger bank Upgrade grew during a dismal 2023
29:04
29:04
Play later
Play later
Lists
Like
Liked
29:04
The neobank doubled its membership to five million consumers and hired 200 people last year. Founder and CEO Renaud Laplanche explains how it fared during a time when many fintechs struggled.
…
continue reading
1
How generative AI could reshape financial services in 2024
17:02
17:02
Play later
Play later
Lists
Like
Liked
17:02
Mike Abbott, global banking lead at Accenture, shared some of his predictions and opinions for the year ahead.
…
continue reading
1
Has the fintech movement lived up to its promise?
34:43
34:43
Play later
Play later
Lists
Like
Liked
34:43
The fintech revolution has been more successful at working with banks than at trying to replace them, points out Gene Ludwig, former Comptroller of the Currency, chair of the Ludwig Institute for Shared Economic Prosperity, and co-founder of Canapi Ventures. Those with “must have” products will fare far better in 2024 than those with “nice to have”…
…
continue reading
1
A Deep Dive Into Generative's Newest Models: Gemini vs Mistral (Mixtral-8x7B)–Part I
47:50
47:50
Play later
Play later
Lists
Like
Liked
47:50
For the last paper read of the year, Arize CPO & Co-Founder, Aparna Dhinakaran, is joined by a Dat Ngo (ML Solutions Architect) and Aman Khan (Product Manager) for an exploration of the new kids on the block: Gemini and Mixtral-8x7B. There's a lot to cover, so this week's paper read is Part I in a series about Mixtral and Gemini. In Part I, we prov…
…
continue reading
1
What to expect from VCs in 2024: Amy Nauiokas, Anthemis Group
28:06
28:06
Play later
Play later
Lists
Like
Liked
28:06
It's been a rough year for fintechs and for the venture capital firms that fund them. Venture capital flows into financial technology companies dropped by 36% year over year to $6 billion in the third quarter of 2023. But Amy Nauiokas, founder and CEO of Anthemis Group, is optimistic about 2024.
…
continue reading
1
How to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain Settings
44:59
44:59
Play later
Play later
Lists
Like
Liked
44:59
We’re thrilled to be joined by Shuaichen Chang, LLM researcher and the author of this week’s paper to discuss his findings. Shuaichen’s research investigates the impact of prompt constructions on the performance of large language models (LLMs) in the text-to-SQL task, particularly focusing on zero-shot, single-domain, and cross-domain settings. Shu…
…
continue reading
1
2023 was a rough year for bank regulators. What might 2024 bring?
17:08
17:08
Play later
Play later
Lists
Like
Liked
17:08
Over the past year, the national bank regulators’ oversight of Silicon Valley Bank, Signature Bank, Silvergate Capital and other banks that failed has been criticized. Reports of a toxic workplace at the FDIC have come to light. And the OCC hired a Deputy Comptroller and overseer of fintech who had easily discoverable falsehoods on his resume. Mich…
…
continue reading
1
The Geometry of Truth: Emergent Linear Structure in LLM Representation of True/False Datasets
41:02
41:02
Play later
Play later
Lists
Like
Liked
41:02
For this paper read, we’re joined by Samuel Marks, Postdoctoral Research Associate at Northeastern University, to discuss his paper, “The Geometry of Truth: Emergent Linear Structure in LLM Representation of True/False Datasets.” Samuel and his team curated high-quality datasets of true/false statements and used them to study in detail the structur…
…
continue reading
1
What fintechs think of the CFPB’s proposed data-sharing rule
30:50
30:50
Play later
Play later
Lists
Like
Liked
30:50
Penny Lee, president and CEO of the Financial Technology Association and Steve Boms, executive director of FDATA NA, explain what their members like about the proposed regulation and what they would change.
…
continue reading
1
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
44:50
44:50
Play later
Play later
Lists
Like
Liked
44:50
In this paper read, we discuss “Towards Monosemanticity: Decomposing Language Models Into Understandable Components,” a paper from Anthropic that addresses the challenge of understanding the inner workings of neural networks, drawing parallels with the complexity of human brain function. It explores the concept of “features,” (patterns of neuron ac…
…
continue reading
1
How community banks can use tech to stay relevant
20:09
20:09
Play later
Play later
Lists
Like
Liked
20:09
Community banks sometimes feel that they lack the budget and staff to compete with larger banks and fintechs on things like mobile and online banking, virtual assistants and most recently generative AI. Jim Perry, senior strategist at Market Insights, suggests steps they can and should take to stay relevant technology wise.…
…
continue reading
1
What might the Sam Bankman-Fried trial mean for banks?
17:12
17:12
Play later
Play later
Lists
Like
Liked
17:12
The case is not really about cryptocurrency but about fraud, points out Seoyoung Kim, department chair and associate professor of finance and business analytics at the Leavey School of Business at Santa Clara University. But regulators and lawmakers are watching and the outcome of the trial will have repercussions throughout finance.…
…
continue reading
1
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models
43:49
43:49
Play later
Play later
Lists
Like
Liked
43:49
We discuss RankVicuna, the first fully open-source LLM capable of performing high-quality listwise reranking in a zero-shot setting. While researchers have successfully applied LLMs such as ChatGPT to reranking in an information retrieval context, such work has mostly been built on proprietary models hidden behind opaque API endpoints. This approac…
…
continue reading
1
Explaining Grokking Through Circuit Efficiency
36:12
36:12
Play later
Play later
Lists
Like
Liked
36:12
Join Arize Co-Founder & CEO Jason Lopatecki, and ML Solutions Engineer, Sally-Ann DeLucia, as they discuss “Explaining Grokking Through Circuit Efficiency." This paper explores novel predictions about grokking, providing significant evidence in favor of its explanation. Most strikingly, the research conducted in this paper demonstrates two novel an…
…
continue reading
1
Citizens’ plans for using generative AI ethically: Beth Johnson
19:24
19:24
Play later
Play later
Lists
Like
Liked
19:24
Johnson, chief experience officer at Citizens Financial Group, shares some of her concerns about advanced AI and plans to use it for purposes including contact center support and coding.
…
continue reading
1
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior
42:14
42:14
Play later
Play later
Lists
Like
Liked
42:14
Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning. In this episode, we discuss the paper, “Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior.” This episode is …
…
continue reading
1
Why Ready Life CEO Ashley Bell is buying a Utah bank
25:51
25:51
Play later
Play later
Lists
Like
Liked
25:51
Bell, an attorney, founded Ready Life to help reduce the racial wealth and homeownership gaps by showing lenders that credit-score-less consumers have been responsible with their money, based on their daily transactions. Now he and Bernice King, Martin Luther King Jr.'s daughter, are buying a community bank just outside of Salt Lake City.…
…
continue reading
1
Are subprime cards a raw deal for people living paycheck to paycheck?
21:58
21:58
Play later
Play later
Lists
Like
Liked
21:58
What are the best options out there for people living on the edge financially who have an emergency expense? The Global Black Economic Forum and the Center for Business and Economic Research recently completed a study called the 2023 Cash Poor Report that dives into this question.
…
continue reading
1
Skeleton of Thought: LLMs Can Do Parallel Decoding
43:39
43:39
Play later
Play later
Lists
Like
Liked
43:39
Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning. In this paper reading, we explore the paper ‘Skeleton-of-Thought’ (SoT) approach, aimed at reducing large language model latency while enhancing answer…
…
continue reading
1
Partner Insights from ExtraHop: Catching cyber criminals in complicated banking networks
17:10
17:10
Play later
Play later
Lists
Like
Liked
17:10
Financial institutions can trade billions of dollars per day and handle sensitive data for millions of customers across the globe, which make them enormously attractive targets for cybercriminals. Their defenses must be top notch and ever evolving to keep up with this threat, but FIs' infrastructures are usually vast and complex, straddling old, le…
…
continue reading
1
Llama 2: Open Foundation and Fine-Tuned Chat Models
30:26
30:26
Play later
Play later
Lists
Like
Liked
30:26
Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning. This episode is led by Aparna Dhinakaran ( Chief Product Officer, Arize AI) and Michael Schiff (Chief Technology Officer, Arize AI), as they discuss th…
…
continue reading
1
Lost in the Middle: How Language Models Use Long Contexts
42:28
42:28
Play later
Play later
Lists
Like
Liked
42:28
Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning. This episode is led by Sally-Ann DeLucia and Amber Roberts, as they discuss the paper "Lost in the Middle: How Language Models Use Long Contexts." This…
…
continue reading
1
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
42:03
42:03
Play later
Play later
Lists
Like
Liked
42:03
Deep Papers is a podcast series featuring deep dives on today’s seminal AI papers and research. Hosted by AI Pub creator Brian Burns and Arize AI founders Jason Lopatecki and Aparna Dhinakaran, each episode profiles the people and techniques behind cutting-edge breakthroughs in machine learning. In this episode, we talk about Orca. Recent research …
…
continue reading
1
The unintended consequences of banning ChatGPT at work
23:42
23:42
Play later
Play later
Lists
Like
Liked
23:42
Though generative AI has limitations and risks, there is a cost to ignoring it, according to Ryan Favro, a managing principal at Capco.
…
continue reading