Voice AI Will Change How We Bank

Voice AI Will Change How We Bank

By James da Costa and Angela Strange

When Waymo launched in San Francisco in August 2023, few bystanders expected it to take a significant portion of the ride-sharing market. Fifteen months later, Waymo represents 22% of all SF ride-shares — equal to that of 13-year-old Lyft. Similarly, many industry players didn’t initially expect autonomous agents, let alone autonomous voice agents, to gain traction in banking. Yet, just like with Waymo, we anticipate that before long, we will begin to see a world where businesses and consumers can sit back and have their finances run on autopilot.

Financial services make up 25% of total spend on all global contact centers and over $100 billion in annual business process outsourcing (BPO) spend, whether it be verifying consumers with voice-based biometric authentication or managing complex customer issues via call centers. Even before the launch of ChatGPT, many banks were experimenting with voice automation: Bank of America's voice agent Erica had 1 billion interactions by 2022.

Voice AI has not only reached par with humans; in some cases, it's better

As our colleagues have written, voice AI is at an exciting inflection point. Voice agents allow you to scale to peak demand, staff phones 24/7, communicate in your customer’s preferred language, and operate at a fraction of the cost of human employees. Consider an immigrant from the Philippines with an auto loan who is working 9 a.m to 5 p.m. every weekday. Now, instead of needing to take time off work to get her questions answered, she can converse with an infinitely patient AI agent in her native Tagalog over the weekend. Early data suggests that some customers prefer speaking with an AI than with a human: imagine always getting the same customer service agent, with perfect memory of your every interaction every time you call.

Which brings us to the question: Why now? In 2024, a number of technical advances at the AI voice model layer paved the way for models to reach minimum thresholds in latency and emotion for wider adoption, leading to the emergence of exciting new AI applications. For example, ChatGPT Advanced Voice Mode added internet search, while Google launched Gemini 2.0 with native audio. This gave rise to a plethora of new voice agents and AI scribes across verticals.

Building on these core model improvements, several players have emerged that abstract away infrastructure requirements and provide tooling to get voice agents up and running quickly and reliably. Platforms like ElevenLab’s conversational AI, Vapi, Retell, and Bland all manage the orchestration of third-party speech-to-text (STT) models, LLMs, and text-to-speech (TTS) models. Yet, as startup and enterprise use cases reach scale, an increasing number of players are moving to their own infrastructure stacks across model orchestration (e.g., Pipecat), telephony (e.g., telynx), evaluations and testing (e.g., Coval), and custom databases for knowledge retrieval (e.g., Pinecone).

What we believe is required for voice agents to win in financial services

For voice agents in finance to become successful, they will need to navigate industry-specific nuances, such as:

  • Compliance and regulatory requirements: Like their human counterparts, voice agents in finance need to adhere to strict regulatory requirements, including limitations on investment advice. Due to the potential for financial crime and fraud, agents will also need to build in additional or multi-step customer verification flows, especially as voice cloning becomes easier.
  • Custom integrations and deployments: Teams building agents in finance will likely need to navigate on-prem deployments and access data from a range of legacy systems of record that require custom integrations.
  • Domain-specific knowledge: Agents will need to be trained on a corpus of industry-specific terms and jargon, and have breadth across a large range of financial products (e.g., checking, savings, investments, credit, and mortgages).

Ultimately, we believe voice AI is a powerful wedge in financial services to displace downstream legacy software in the form of additional workflows and building new AI-native systems of record. We have put together an initial view of emerging voice workflow opportunities in financial services below and will publish a more comprehensive deep dive in the future. If you're building in this space or should be included on our market map, we'd love to hear from you.


More From the a16z Fintech Team

How does a company as massive as Mastercard decide where to deploy AI? Instead of centralizing every AI initiative, Mastercard operates under a “hub and spoke” model — balancing centralized AI leadership with decentralized innovation across business units. In this episode of In the Vault, a16z Partner Marc Andrusko chats with Mastercard’s Chief AI and Data Officer Greg Ulrich about Mastercard’s long history of using AI, the opportunities (and potential risks) associated with integrating generative AI into fraud detection, determining what tech to employ based on use cases, and the best advice he’s ever gotten.

In this episode of AI + a16z, a16z Partner Alex Immerman sits down with Hebbia?Founder and CEO George Sivulka to discuss the potential for reasoning models and AI agents to supercharge knowledge-worker productivity — and the global economy along with it. As George explains, his customers are already saving significant time and effort on important, but monotonous, tasks, and improved models paired with savvy users will continue to reshape how industries including finance, law, and other professional services operate.

a16z General Partner David Haber spoke to Fortune on his journey from building financing startup Bond Street, which he sold to Goldman Sachs, to joining a16z and the New York startup ecosystem.

a16z portfolio company Sardine raised $70 million to make fraud and compliance teams more productive through AI agents. Also, a16z is doubling down on our investment in Camber and leading their Series B round of financing.


Recent M&A Deals and Market Intel

Monzo is reportedly holding preliminary discussions with bankers as it seeks to get “IPO ready” by the end of this year. Monzo’s CEO is advocating for a U.S. listing, while the company’s board is leaning towards a listing in the U.K.

Tabby is reportedly in talks with HSBC, JP Morgan, and Morgan Stanley for a public offering in Saudi Arabia in late 2025 or 2026.

HiBob announced its acquisition of Mosaic, a cloud-based FP&A platform, on February 13. The acquisition enables HiBob to offer a fully integrated solution that helps mid-market companies gain real-time insights into workforce productivity, employee engagement, and financial outcomes.

Rapyd is in talks to raise $300 million from investors at a $3.5 billion valuation. It intends to use the funding in part to acquire a payment processing startup.

Worldpay announced its acquisition of Ravelin, a next-gen fraud prediction and prevention solution for e-commerce merchants, on February 4. The acquisition complements and enhances Worldpay’s portfolio of value-added solutions. It is Worldpay’s first acquisition since spinning off from FIS last year.


Connect with a16z

  • Interested in receiving this newsletter on a monthly basis? Subscribe here.
  • Want more insights from a16z? Subscribe to our suite of newsletters here.


The views expressed here are those of the individual AH Capital Management, L.L.C. (“a16z”) personnel quoted and are not the views of a16z or its affiliates. Certain information contained in here has been obtained from third-party sources, including from portfolio companies of funds managed by a16z. While taken from sources believed to be reliable, a16z has not independently verified such information and makes no representations about the enduring accuracy of the information or its appropriateness for a given situation. In addition, this content may include third-party advertisements; a16z has not reviewed such advertisements and does not endorse any advertising content contained therein.

This content is provided for informational purposes only, and should not be relied upon as legal, business, investment, or tax advice. You should consult your own advisers as to those matters. References to any securities or digital assets are for illustrative purposes only, and do not constitute an investment recommendation or offer to provide investment advisory services. Furthermore, this content is not directed at nor intended for use by any investors or prospective investors, and may not under any circumstances be relied upon when making a decision to invest in any fund managed by a16z. (An offering to invest in an a16z fund will be made only by the private placement memorandum, subscription agreement, and other relevant documentation of any such fund and should be read in their entirety.) Any investments or portfolio companies mentioned, referred to, or described are not representative of all investments in vehicles managed by a16z, and there can be no assurance that the investments will be profitable or that other investments made in the future will have similar characteristics or results. A list of investments made by funds managed by Andreessen Horowitz (excluding investments for which the issuer has not provided permission for a16z to disclose publicly as well as unannounced investments in publicly traded digital assets) is available at https://a16z.com/investments/.

Charts and graphs provided within are for informational purposes solely and should not be relied upon when making any investment decision. Past performance is not indicative of future results. The content speaks only as of the date indicated. Any projections, estimates, forecasts, targets, prospects, and/or opinions expressed in these materials are subject to change without notice and may differ or be contrary to opinions expressed by others. Please see https://a16z.com/disclosures for additional important information.


Thank you for covering Customer Support - An important piece of the puzzle. Orin has over 50+ Fintech AI-Agents for Contact Center workflows.

David Douglas

AI-Driven Operations Leader: Specialising in Cloud IT, Advanced AI Systems at Scale, for Business Operations & Project Portfolio Delivery with compliance, milestones, risk and supplier performance management.

4 小时前

Voice AI is coming for every sector sooner than most think! We are already using Natural Language Processing (NLP), speech over the desktop for business operations application user guides, screen navigation and data sorting in a project portfolio management application. The principles no different for any software application...Fintech included, if the LLM is trained to remove hallucinations and the data is trust worthy.

回复
Ronnie Gosch

thejanesays.com

5 小时前

Daughter is a brain surgeon. Erin, Billionaire voice, Family practice Dr., Me, Internal Medicine- Me, Millionaire voice. Trillionaire.

回复
Ronnie Gosch

thejanesays.com

5 小时前

Nobody in the CBA said anything to me after the 13X over thing for 4 more years.

回复

要查看或添加评论,请登录

Andreessen Horowitz的更多文章