Featured Article

That agent who sounds like they’re from Paris, Texas? Try Paris, France

A startup called Sanas has built voice AI to change accents — now it’s raised $32M from GV and others

Comment

VOIP headset on laptop computer keyboard.
Image Credits: lenets_Mikolay / Getty Images

Our daily lives are more linked into a globalized grid than ever before. Products are sourced and shipped from afar; traveling to a place 3,000 miles away can be easier than getting across a big city in traffic; and information disseminates to anyone and everyone at the tap of a finger.

A startup called Sanas has developed some AI voice technology that aims to make one critical component of that grid work more smoothly — how people speaking the same language but with different accents can understand each other better, by filtering accented voices and turning those accents in real time into other ones. Today the startup is announcing $32 million in funding on the heels of strong momentum for its tools as it comes out of stealth and launches more widely.

Insight Partners is leading the investment, with participation from new backers GV (formerly Google Ventures), strategic backer Assurant Ventures and angel investor Gokul Rajaram. Previous backers from its seed round Human Capital, General Catalyst, Quiet Capital and DN Capital are also participating in this Series A. Along with the investment, Sanas is also announcing a strategic partnership with Alorica, one of the largest BPOs in the world, which is rolling out the tech to 100,000 employees and 250 enterprise customers globally.

The company is not disclosing valuation but we understand that it is $150 million post-money. This Series A is one of the biggest for a voice AI startup, and from what we understand, it’s coming after Sanas turned down an acquisition offer from big tech. (If you can’t buy ’em, invest in ’em!)

As you might have surmised from its list of investors, Sanas’ technology is already being deployed in call centers. Specifically, it’s found a lot of traction with far-flung customer service providers, which have become a hotbed of abuse against agents who might speak the same language as a customer, but heavily accented.

In addition to insurance giant Assurant and BPO leviathan Alorica, other customers include the the large collection agency firm ERC and travel industry BPO IGT. In a sad comment on the state of our world, Sanas’s CEO and co-founder Maxim Serebryakov said the result of using the tech in these places has been dramatic in terms of the reduction in agent harassment.

Sanas’s plan is to use the funding both to continue expanding its business in that vertical and to start to shape up for other use cases in the enterprise — for example, as a plug-in for video calls or for voice-based interactive services to help machines (and ML-based systems) understand a wider range of accents.

Serebryakov initially co-founded the company with Shawn Zhang and Andrés Pérez Soderi, two fellow students at Stanford’s artificial intelligence lab, after a fourth friend of theirs needed to leave school and return to his native country, Nicaragua, to take a job to help with a family emergency.

The friend took a job at a call center back home serving customers in the U.S., and even though he was completely fluent — and a student taking a break from Stanford no less — he faced endless abuse over the phone from people who didn’t like his accent.

The three others understood this judgment, response and abuse all too well, being first-generation immigrants themselves (and I’ll add that I know this very well firsthand, too, both in my current life and growing up as a first-generation immigrant in the U.S.). And so they decided to put their AI learnings to the test to see if they could fix it. (Earlier this year, Sanas also picked up a fourth co-founder, Sharath Keshava Narayana, who is also now COO and who left another company he co-founded, Observe.AI, after learning about the company and wanting to be involved in building it.)

There are tons of tools out there today to “autotune” and modify a person’s voice in real time or delayed time — they are about as common as photo filters at this point. But as Serebryakov notes, it’s especially tricky to be able to preserve the natural, actual voice and change how it is saying what it is saying.

Interestingly, the problem is one that is so abstracted — Sanas has approached fixing it by ingesting thousands of hours of differently accented speech into a system and ordering it to match with other sounds, with the whole mix of technology and method now also in a patent process — that the end result is that Sanas’s accent “translation” engine can be used with any language at all, not just English as you might have assumed. (Serebryakov tells me it’s being used already to “smooth out” accents across Japan, China and South Korea, for example.)

“Technology like this is applicable globally, from one accent to another,” he said. “It will take time, but our goal is to let people communicate in any accent at all.”

There is a certain unease around the very concept of what Sanas is tackling and doing here. It raises a lot of questions of potential abuse, and apart from that, some might find it distasteful and passé for technology to be developed specifically to obscure a person’s true identity: Shouldn’t the people who are judging over accents learn to be more open-minded and accepting, rather than people forever accommodating prejudices by hiding anything that marks others as outsiders or different?

There are points that argue against these, too, though. Sanas is specifically not building any applications for consumers or making its tech accessible to them at the moment precisely because of how that might get misused. Even its customers are not using a cloud-based version of the tech: To keep things extra-secure, it sits on premises and so customers control their own data that passes and is generated through Sanas.

On the part of obscuring true identity, that is definitely a bigger issue that we all need to tackle on a daily basis. In the meantime, this is giving those who are at the sharper end of those jabs a way of coping better, and in some very practical ways making it easier for people (even those who are well-intentioned) simply to understand each other without accents getting in the way.

I had a demo of the service during my interview, where Sanas rang up one of their customers’ agents in India and got him to chat with me first with his own accent, and then he “turned on” his Midwestern-neutral tone. It was a little creepy knowing what was happening in the background, but on the surface, I was very surprised at how natural it all seemed — well, natural enough, at least. His voice was clear, but maybe a little too clear, and almost a tiny bit robotic and emotion-free.

Apparently, that, too, is somewhat intentional for now and might evolve if that’s what customers and other users want.

“The reason we are focused on call centers is that it’s a low hanging fruit,” Serebryakov said, noting that the difficulties of building what is effectively groundbreaking tech was hard enough, but it also happens to fit the use case. “For us when building this, it was important to go down the path of least resistance. No singing, no laughter, no hyper-emotive speech. What we are tackling is we’re trying to give control over how these users interact at work.” There’s no crying in baseball, and there’s no fun and games in the call center, either.

“Insight Partners is thrilled to deepen its relationship with Sanas on such cutting-edge and transformative technology,” said Ganesh Bell, an MD at Insight, in a statement. “As the company comes out of its stealth phase, I look forward to working with this immensely talented and passionate team to build a product that will, among many things, help eliminate the unfortunate biases and discrimination experienced by those who speak English as a second language, which includes many of Sanas’ employees.”

More TechCrunch

Apple is set to board the runaway locomotive that is generative AI at next week’s World Wide Developer Conference. Reports thus far have pointed to a partnership with OpenAI that…

Apple’s generative AI offering might not work with the standard iPhone 15

LinkedIn has confirmed it will no longer allow advertisers to target users based on data gleaned from their participation in LinkedIn Groups. The move comes more than three months after…

LinkedIn to limit targeted ads in EU after complaint over sensitive data use

Founders: Need plans this weekend? What better way to spend your time than applying to this year’s Startup Battlefield 200 at TechCrunch Disrupt. With Monday’s deadline looming, this is a…

Startup Battlefield 200 applications due Monday

The company is in the process of building a gigawatt-scale factory in Kentucky to produce its nickel-hydrogen batteries.

Novel battery manufacturer EnerVenue is raising $515M, per filing

Meta is quietly rolling out a new “Communities” feature on Messenger, the company confirmed to TechCrunch. The feature is designed to help organizations, schools and other private groups communicate in…

Meta quietly rolls out Communities on Messenger

Featured Article

Siri and Google Assistant look to generative AI for a new lease on life

Voice assistants in general are having an existential moment, and generative AI is poised to be the logical successor.

5 hours ago
Siri and Google Assistant look to generative AI for a new lease on life

Education software provider PowerSchool is being taken private by investment firm Bain Capital in a $5.6 billion deal.

Bain to take K-12 education software provider PowerSchool private in $5.6B deal

Shopify has acquired Threads.com, the Sequoia-backed Slack alternative, Threads said on its website. The companies didn’t disclose the terms of the deal but said that the Threads.com team will join…

Shopify acquires Threads (no, not that one)

Featured Article

Bangladeshi police agents accused of selling citizens’ personal information on Telegram

Two senior police officials in Bangladesh are accused of collecting and selling citizens’ personal information to criminals on Telegram.

16 hours ago
Bangladeshi police agents accused of selling citizens’ personal information on Telegram

Carta, a once-high-flying Silicon Valley startup that loudly backed away from one of its businesses earlier this year, is working on a secondary sale that would value the company at…

Carta’s valuation to be cut by $6.5 billion in upcoming secondary sale

Boeing’s Starliner spacecraft has successfully delivered two astronauts to the International Space Station, a key milestone in the aerospace giant’s quest to certify the capsule for regular crewed missions.  Starliner…

Boeing’s Starliner overcomes leaks and engine trouble to dock with ‘the big city in the sky’

Rivian needs to sell its new revamped vehicles at a profit in order to sustain itself long enough to get to the cheaper mass market R2 SUV on the road.

Rivian’s path to survival is now remarkably clear

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

22 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

HSBC and BlackRock estimate that the Indian edtech giant Byju’s, once valued at $22 billion, is now worth nothing.

HSBC believes that $22 billion Byju’s is now worth zero

As WWDC 2024 nears, all sorts of rumors and leaks have emerged about what iOS 18 and its AI-powered apps and features have in store.

What to expect from Apple’s AI-powered iOS 18 at WWDC 2024

Apple’s annual list of what it considers the best and most innovative software available on its platform is turning its attention to the little guy.

Apple’s Design Awards highlight indies and startups

Meta launched its Meta Verified program today along with other features, such as the ability to call large businesses and custom messages.

Meta rolls out Meta Verified for WhatsApp Business users in Brazil, India, Indonesia and Colombia

Last year, during the Q3 2023 earnings call, Mark Zuckerberg talked about leveraging AI to have business accounts respond to customers for purchase and support queries. Today, Meta announced AI-powered…

Meta adds AI-powered features to WhatsApp Business app

TikTok is testing streaks that are similar to Snapchat’s in order to boost engagement, including how long people stay on the app.

TikTok is testing Snapchat-like streaks

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Your usual…

Inside Fisker’s collapse and robotaxis come to more US cities

New York-based Revel has made a lot of pivots since initially launching in 2018 as a dockless e-moped sharing service. The BlackRock-backed startup briefly stepped into the e-bike subscription business.…

Revel to lay off 1,000 staff ride-hail drivers, saying they’d rather be contractors anyway

Google says apps offering AI features will have to prevent the generation of restricted content.

Google Play cracks down on AI apps after circulation of apps for making deepfake nudes

The British retailers association also takes aim at Amazon’s “Buy Box,” claiming that Amazon manipulated which retailers were selected for the coveted placement.

Amazon slammed with £1.1B data abuse lawsuit from UK retailers

Featured Article

Rivian overhauled the R1S and R1T to entice new buyers ahead of cheaper R2 launch

Rivian has changed 600 parts on its R1S SUV and R1T pickup truck in a bid to drive down manufacturing costs, while improving performance of its flagship vehicles.  The end goal, which will play out over the coming year, is an existential one. Rivian lost about $38,784 on every vehicle…

1 day ago
Rivian overhauled the R1S and R1T to entice new buyers ahead of cheaper R2 launch

Twitch has come up with a solution for the ongoing copyright issues that DJs encounter on the platform. The company announced Thursday a new program that enables DJs to stream…

Twitch DJs will now have to pay music labels to play songs in livestreams

Google said today it is partnering with RapidSOS, a platform for emergency first responders, to enable users to contact 911 through RCS (Rich Messaging Service).

Google partners with RapidSOS to enable 911 contact through RCS

Long before product-led growth became a buzzword, Atlassian offered free tiers for virtually all of its productivity and developer tools. Today, that mostly means free access for up to 10…

Atlassian now gives startups a year of free access

Featured Article

A social app for creatives, Cara grew from 40k to 650k users in a week because artists are fed up with Meta’s AI policies

Artists have finally had enough with Meta’s predatory AI policies, but Meta’s loss is Cara’s gain. An artist-run, anti-AI social platform, Cara has grown from 40,000 to 650,000 users within the last week, catapulting it to the top of the App Store charts. Instagram is a necessity for many artists,…

1 day ago
A social app for creatives, Cara grew from 40k to 650k users in a week because artists are fed up with Meta’s AI policies

Google has developed a new AI tool to help marine biologists better understand coral reef ecosystems and their health, which can aid in conversation efforts. The tool, SurfPerch, created with…

Google looks to AI to help save the coral reefs

Only a few years ago, one of the hottest topics in enterprise software was ‘robotic process automation’ (RPA). It doesn’t feel like those services, which tried to automate a lot…

Tektonic AI raises $10M to build GenAI agents for automating business operations