AI

Salesforce is betting that its own content can bring more trust to generative AI

Comment

Illustration of two business people analyzing charts and graphs.
Image Credits: z_wei / Getty Images

It has become apparent in recent weeks that generative AI has the potential to transform how we interact with software, allowing us to describe what we want instead of clicking or tapping. That shift could have a profound impact on enterprise software. At the Salesforce World Tour NYC event last week, that vision was on full display.

Consider that during the 67-minute main keynote, it took less than five minutes for Salesforce CMO Sarah Franklin to introduce the subject of ChatGPT. The company then spent the next 40 minutes and several speakers talking about generative AI and the impact it would have across the entire platform. The final speaker talked about Data Cloud, an adjacent technology. It’s fair to say that other than a few minutes of introduction, it was all the company talked about.

That included discussions of EinsteinGPT, a tool for asking questions about Salesforce content, and SlackGPT, a tool for asking Slack questions about its content. In addition, the company talked about the ability to create landing pages on the fly, write sales emails (if that’s what you want) and write Apex code (Salesforce’s programming language) to programmatically trigger certain actions in a workflow, among other things.

When you think about the fact that generative AI wasn’t even really a thing people were talking about until OpenAI released ChatGPT at the end of last year, and events like this take months of planning, the company probably had to switch gears recently to focus its presentation so completely on this single subject.

Salesforce isn’t alone in its new focus on applying generative AI to its existing products and services. Over the past several months, we’ve seen many enterprise software companies announce plans to incorporate this technology into their stacks, even if overall most of these new tools are still a work in progress.

Just last week we had announcements from Zoho, Box and ServiceNow, while other companies too numerous to mention individually have made similar announcements in recent months.

A year after we saw the crypto and metaverse hype machines come crashing down, it’s fair to ask if these companies are moving too fast, chasing the next big shiny thing without considering some of the technology’s limitations, especially its well-documented hallucination problem. For this post, we are going to concentrate on Salesforce’s view of things and how it hopes to overcome some of those known issues when it comes to incorporating generative AI onto the platform.

Got 99 problems, but data ain’t one

Perhaps it’s unfair to put generative AI in the same category as other hyped technologies because we are only now seeing the direct impact of this approach. It took decades of research, development and technological shifts to get us to this point, said Juan Perez, Salesforce’s CIO, who is in charge of the company’s technology strategies.

“This is different, actually. First of all, it’s more real, and AI is not new. We’ve had decades and decades of advancement in AI,” Perez said. And he pointed out that it’s not new for Salesforce, either. It introduced its AI layer, Einstein, back in 2016, and has been refining it ever since.

Perez told TechCrunch+ that he actually uses Einstein AI to help generate reports to do his work, and the developments we are seeing with generative AI will only make the process easier. “With the advances of generative AI, with the compute power, the large-scale systems that can support these large language models, the game is entirely different,” he said.

One theme that Salesforce kept coming back to at the event was the notion of trust and that building AI solutions on top of Salesforce data could help develop more trusted AI. A more trustworthy underlying dataset could in turn help limit hallucination issues where the AI doesn’t actually know with certainty what the response should be and essentially makes one up.

But the company is working hard to make sure that the AI is giving the best answers possible with the understanding that nobody can guarantee that the generative AI won’t hallucinate answers at this point, according to Silvio Savarese, the company’s EVP and chief scientist.

“Good quality data is key for generating good quality outputs.Training or fine-tuning models using curated high-quality CRM data allows you to build trusted generative capabilities. However, even with high-quality data, LLMs can still generate hallucinations,” he said. It’s important to understand that as you implement the technology at your company.

Salesforce is working to mitigate the problem on several fronts, he said. By building its own models, the company can control for some factors that can cause the model to hallucinate. “We have full control of the learning procedure … can inject additional labeling/instruction capabilities and embed constitutional AI methods to mitigate hallucinations,” he said.

In addition, training can be ongoing rather than training once and deploying, as is sometimes the case with LLMs today, he said. “This is especially vital in the world of CRM, where data is constantly changing and freshness is mission critical. By keeping LLMs trained on the most up-to-date information, a common source of mistakes can be minimized.” It’s worth noting, however, that as customers build or bring their own LLMs, Salesforce will still supply the data but have less control over how it gets incorporated, managed and used in external models.

A matter of trust

By using a more constrained set of data for the LLMs that comes from a source like Salesforce, the company is operating on the theory that it will limit the hallucination problem. Vishal Sikka, CEO and founder at Vianai Systems, an MLOps startup told TechCrunch+ in a recent interview that it’s imperative to solve the hallucination issue before it can be used in mission-critical applications in enterprise settings.

“The first part is the safety issue because in the current state of the art, the scientists who have built this transformer technology don’t know how to make it produce good answers and not produce bad ones. They don’t know if it is even possible that it can be done,” he said.

That means that if you have a problem that requires a precise answer, you need total certainty, and we don’t have that yet.

But Ray Wang, founder and principal analyst at Constellation Research, told TechCrunch+ that there are business cases where you don’t need total accuracy to be useful.

“Generative AI ultimately requires massive amounts of data for high precision,” he said. “This requires removing false positives and false negatives with training and human augmentation. Areas where we need 100% accuracy will be hard to achieve, but if we can live with 70% or 80% accuracy, many tasks such as self-service customer care, or sales lead scoring, or campaign automation will become easier.”

Brent Hayward, CEO at Salesforce subsidiary Mulesoft, thinks that putting humans, who understand the data in the process could help tell the model when it’s right and when it’s not, what he calls “tuning for true.” That could help correct the AI when wrong and help improve models along the way.

“If the generative AI is helping create a workflow and generating code to help, the source of that code really matters,” Hayward said. “If the dataset we’ve trained the model on is all of our API’s, you can say the trust is quite high.”

He sees possibly developing a trust score based on where the data is coming from, and how much we can rely on the answers from a given set of data, an approach he thinks will be increasingly important.

People in fact remain a key part of Salesforce’s AI vision, Savarese said. “By enabling human-in-the-loop capabilities, users can verify the quality of the output of generative AI and intervene to fix hallucinations or other factual errors. This is both a powerful safety feature and an example of our core value at Salesforce AI, which is augmenting human talent rather than attempting to replace it,” he said.

Perez anticipates that part of his job, and that of all CIOs moving forward, will be ensuring that the company’s LLMs are using trusted data. “Remember the evolution of the CIO in the areas of security and privacy. We have had to really take a much stronger stance as CIOs to ensure that security is a priority, that privacy is priority. Well, now with generative AI, I think CIOs are going to have to also be like the guards of the castle and will have to ensure that there’s trusted data in support of AI,” he said.

It’s more than hallucinations

The hallucination issue is just one of the problems associated with generative AI. Another issue will be making sure that the generative AI doesn’t supply confidential company information or other sensitive data to people who aren’t supposed to see it.

Patrick Stokes, EVP and GM of platform at Salesforce, thinks that there will be limits put on what types of data can be put in the models to prevent this from happening. “Businesses and organizations like Salesforce are going to have to start to figure out what some of those swim lanes look like,” he said.

In practice that would mean looking at hiding certain fields from the model if it includes data you didn’t want unauthorized people seeing, but that’s still something that companies like Salesforce need to work out.

There’s also the issue of data ownership. For example, if you are creating a landing page on the fly, do you have permission to use the photos on that landing page (or the source of generated images)? These kinds of legal issues could slow enterprise enthusiasm for generative AI until there are clearer answers.

It’s going to be imperative to solve all of these problems, and others that are sure to arise, as we insert generative AI into more of our software. But of all the issues, limiting hallucinations is going to be paramount because everyone using the generative AI capabilities in Salesforce (and all enterprise software) is going to need to trust that the answers they are getting from the system are true and accurate and not putting the company at risk.

Salesforce is making a big bet that using its own data in LLMs will be the key to doing this. Time will tell if this is right, or at least, if it can help limit the problem.

More TechCrunch

China has closed a third state-backed investment fund to bolster its semiconductor industry and reduce reliance on other nations, both for using and for manufacturing wafers — prioritizing what is…

China’s $47B semiconductor fund puts chip sovereignty front and center

Apple’s annual list of what it considers the best and most innovative software available on its platform is turning its attention to the little guy.

Apple’s Design Awards nominees highlight indies and startups, largely ignore AI (except for Arc)

The spyware maker’s founder, Bryan Fleming, said pcTattletale is “out of business and completely done,” following a data breach.

Spyware maker pcTattletale shutters after data breach

AI models are always surprising us, not just in what they can do, but what they can’t, and why. An interesting new behavior is both superficial and revealing about these…

AI models have favorite numbers, because they think they’re people

On Friday, Pal Kovacs was listening to the long-awaited new album from rock and metal giants Bring Me The Horizon when he noticed a strange sound at the end of…

Rock band’s hidden hacking-themed website gets hacked

Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company’s approach to AI safety, has joined OpenAI rival Anthropic to lead a…

Anthropic hires former OpenAI safety lead to head up new team

Welcome to TechCrunch Fintech! This week, we’re looking at the long-term implications of Synapse’s bankruptcy on the fintech sector, Majority’s impressive ARR milestone, and more!  To get a roundup of…

The demise of BaaS fintech Synapse could derail the funding prospects for other startups in the space

YouTube’s free Playables don’t directly challenge the app store model or break Apple’s rules. However, they do compete with the App Store’s free games.

YouTube’s free games catalog ‘Playables’ rolls out to all users

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the first months of 2024. Smaller-sized…

8 hours ago
A comprehensive list of 2024 tech layoffs

OpenAI has formed a new committee to oversee “critical” safety and security decisions related to the company’s projects and operations. But, in a move that’s sure to raise the ire…

OpenAI’s new safety committee is made up of all insiders

Time is running out for tech enthusiasts and entrepreneurs to secure their early-bird tickets for TechCrunch Disrupt 2024! With only four days left until the May 31 deadline, now is…

Early bird gets the savings — 4 days left for Disrupt sale

AI may not be up to the task of replacing Google Search just yet, but it can be useful in more specific contexts — including handling the drudgery that comes…

Skej’s AI meeting scheduling assistant works like adding an EA to your email

Faircado has built a browser extension that suggests pre-owned alternatives for ecommerce listings.

Faircado raises $3M to nudge people to buy pre-owned goods

Tumblr, the blogging site acquired twice, is launching its “Communities” feature in open beta, the Tumblr Labs division has announced. The feature offers a dedicated space for users to connect…

Tumblr launches its semi-private Communities in open beta

Remittances from workers in the U.S. to their families and friends in Latin America amounted to $155 billion in 2023. With such a huge opportunity, banks, money transfer companies, retailers,…

Félix Pago raises $15.5 million to help Latino workers send money home via WhatsApp

Google said today it’s adding new AI-powered features such as a writing assistant and a wallpaper creator and providing easy access to Gemini chatbot to its Chromebook Plus line of…

Google adds AI-powered features to Chromebook

The dynamic duo behind the Grammy Award–winning music group the Chainsmokers, Alex Pall and Drew Taggart, are set to bring their entrepreneurial expertise to TechCrunch Disrupt 2024. Known for their…

The Chainsmokers light up Disrupt 2024

The deal will give LumApps a big nest egg to make acquisitions and scale its business.

LumApps, the French ‘intranet super app,’ sells majority stake to Bridgepoint in a $650M deal

Featured Article

More neobanks are becoming mobile networks — and Nubank wants a piece of the action

Nubank is taking its first tentative steps into the mobile network realm, as the NYSE-traded Brazilian neobank rolls out an eSIM (embedded SIM) service for travelers. The service will give customers access to 10GB of free roaming internet in more than 40 countries without having to switch out their own existing physical SIM card or…

15 hours ago
More neobanks are becoming mobile networks — and Nubank wants a piece of the action

Infra.Market, an Indian startup that helps construction and real estate firms procure materials, has raised $50M from MARS Unicorn Fund.

MARS doubles down on India’s Infra.Market with new $50M investment

Small operations can lose customers by not offering financing, something the Berlin-based startup wants to change.

Cloover wants to speed solar adoption by helping installers finance new sales

India’s Adani Group is in discussions to venture into digital payments and e-commerce, according to a report.

Adani looks to battle Reliance, Walmart in India’s e-commerce, payments race, report says

Ledger, a French startup mostly known for its secure crypto hardware wallets, has started shipping new wallets nearly 18 months after announcing the latest Ledger Stax devices. The updated wallet…

Ledger starts shipping its high-end hardware crypto wallet

A data protection taskforce that’s spent over a year considering how the European Union’s data protection rulebook applies to OpenAI’s viral chatbot, ChatGPT, reported preliminary conclusions Friday. The top-line takeaway…

EU’s ChatGPT taskforce offers first look at detangling the AI chatbot’s privacy compliance

Here’s a shoutout to LatAm early-stage startup founders! We want YOU to apply for the Startup Battlefield 200 at TechCrunch Disrupt 2024. But you’d better hurry — time is running…

LatAm startups: Apply to Startup Battlefield 200

The countdown to early-bird savings for TechCrunch Disrupt, taking place October 28–30 in San Francisco, continues. You have just five days left to save up to $800 on the price…

5 days left to get your early-bird Disrupt passes

Venture investment into Spanish startups also held up quite well, with €2.2 billion raised across some 850 funding rounds.

Spanish startups reached €100 billion in aggregate value last year

Featured Article

Onyx Motorbikes was in trouble — and then its 37-year-old owner died

James Khatiblou, the owner and CEO of Onyx Motorbikes, was watching his e-bike startup fall apart.  Onyx was being evicted from its warehouse in El Segundo, near Los Angeles. The company’s unpaid bills were stacking up. Its chief operating officer had abruptly resigned. A shipment of around 100 CTY2 dirt bikes from Chinese supplier Suzhou…

1 day ago
Onyx Motorbikes was in trouble — and then its 37-year-old owner died

Featured Article

Iyo thinks its GenAI earbuds can succeed where Humane and Rabbit stumbled

Iyo represents a third form factor in the push to deliver standalone generative AI devices: Bluetooth earbuds.

1 day ago
Iyo thinks its GenAI earbuds can succeed where Humane and Rabbit stumbled

Arati Prabhakar, profiled as part of TechCrunch’s Women in AI series, is director of the White House Office of Science and Technology Policy.

Women in AI: Arati Prabhakar thinks it’s crucial to get AI ‘right’