AI

BentoML scores $9M funding to expedite AI app development

Comment

Computer code on a curved screen.
Image Credits: monsitj / Getty Images

The success of large language models like GPT has sparked a frenzy of developers eager to make AI-powered applications. But building AI services can be tricky, especially due to the shortage of skilled developers to meet the rising demand these days.

That’s where Chaoyu Yang, an early software engineer at the data mega-unicorn Databricks, comes in. Along with his co-founders, he’s built the AI development framework BentoML, which just announced a seed financing round.

Yang, in an interview with TechCrunch, explained that today’s AI services are often built on multiple machine learning models, making their management and operation complicated. Many programmers entering the fray are coming from a full-stack or application development background, meaning they often lack the skills to build the required AI infrastructure, resulting in a prolonged development process.

A demo AI app like Microsoft’s Visual ChatGPT, an upgrade to the chatbot that allows it to produce responses from both text and image prompts, for example, can take at least three to six months to make it production-ready, Yang said.

While tech behemoths like Microsoft enjoy the financial prowess and human capital to train AI models and put them to use in the real world, smaller businesses, in Yang’s words, are collecting “valuable data that can benefit tremendously from AI” but “lack the resources to build the infrastructure for development.”

BentoML, which provides a high-level API that abstracts away the details of the infrastructure needed for running AI models on the cloud, belongs to a camp of tools like SageMaker that wants to smooth the path for developing AI services. It’s a so-called AI application framework, a set of tools that make it easier to build, ship and scale AI applications, like a construction tool kit one uses to build a house.

Specifically, BentoML is targeting data scientists who train AI models, DevOp engineers who manage their lifecycle and developers who actually build applications on top of the models.

With BentoML, developers can make Visual ChatGPT scalable and cost-efficient for production use in as short as two days, said Yang. Users have also used the framework to run the art generator Stable Diffusion and open source LLMs on the cloud.

Yang compared his company to Vercel, which focuses on serving front-end developers and was last valued at over $1 billion. BentoML aims to be the Vercel for AI, he said.

While Yang predicted that AI would eventually become more production-ready, he admitted he didn’t think the AI application wave would arrive so soon. The founder expects AI app developers to account for over 90% of the platform’s users in the future.

“If you ask me a year ago, I’d say that probably 90% of the companies would be training their own models, but the foundation models that have recently emerged are so powerful that they can perform well even given a dataset it has never seen before,” he said.

“Rather than focusing on model training, developers now only need to work on finetuning and product engineering, which in themselves present a bottleneck because of the shortage of AI-focused developers.”

BentoML was open sourced in 2019 and later introduced a self-hosted SaaS version to enterprise customers. It’s been acquiring users organically through its open source community, which quadrupled its membership to more than 3,000 over the past year, with Japanese messaging giant Line and South Korean internet conglomerate Naver being among its early adopters.

Yang declined to disclose the company’s revenue size.

Investors are taking note of BentoML’s traction in the developer community. The startup recently raised $9 million from its seed financing round led by DCM Ventures, with Bow Capital also participating. DCM’s general partner, Hurst Lin, has joined BentoML’s board following the round.

The exuberant AI market has been a boon to BentoML, but the rapidly changing industry also makes it tricky for the team to juggle short- and long-term goals, Yang admitted.

“You might have to build things that ride the current trend, but in the long term, we of course want to have our own moat. The question is how we balance our time and human resources between the two.”

Update (June 28, 2023): The article previously stated that Line was a South Korean company. Line is headquartered in Japan and is a subsidiary of Z Holdings, a joint venture between Japan’s SoftBank and South Korea’s Naver.

Vercel raises $150M Series D as it looks to build an end-to-end front-end development platform

More TechCrunch

Boeing’s Starliner spacecraft has successfully delivered two astronauts to the International Space Station, a key milestone in the aerospace giant’s quest to certify the capsule for regular crewed missions.  Starliner…

Boeing’s Starliner overcomes leaks and engine trouble to dock with ‘the big city in the sky’

Rivian needs to sell its new revamped vehicles at a profit in order to sustain itself long enough to get to the cheaper mass market R2 SUV on the road.

Rivian’s path to survival is now remarkably clear

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

2 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

In a research note, HSBC estimates that the Indian edtech giant Byju’s, once valued at $22 billion, is now worth nothing.

HSBC believes that $22 billion Byju’s is now worth zero

As WWDC 2024 nears, all sorts of rumors and leaks have emerged about what iOS 18 and its AI-powered apps and features have in store.

What to expect from Apple’s AI-powered iOS 18 at WWDC 2024

Apple’s annual list of what it considers the best and most innovative software available on its platform is turning its attention to the little guy.

Apple’s Design Awards highlight indies and startups

Meta launched its Meta Verified program today along with other features, such as the ability to call large businesses and custom messages.

Meta rolls out Meta Verified for WhatsApp Business users in Brazil, India, Indonesia and Colombia

Last year, during the Q3 2023 earnings call, Mark Zuckerberg talked about leveraging AI to have business accounts respond to customers for purchase and support queries. Today, Meta announced AI-powered…

Meta adds AI-powered features to WhatsApp Business app

TikTok is testing streaks that are similar to Snapchat’s in order to boost engagement, including how long people stay on the app.

TikTok is testing Snapchat-like streaks

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Your usual…

Inside Fisker’s collapse and robotaxis come to more US cities

New York-based Revel has made a lot of pivots since initially launching in 2018 as a dockless e-moped sharing service. The BlackRock-backed startup briefly stepped into the e-bike subscription business.…

Revel to lay off 1,000 staff ride-hail drivers, saying they’d rather be contractors anyway

Google says apps offering AI features will have to prevent the generation of restricted content.

Google Play cracks down on AI apps after circulation of apps for making deepfake nudes

The British retailers association also takes aim at Amazon’s “Buy Box,” claiming that Amazon manipulated which retailers were selected for the coveted placement.

UK retailers file a £1.1B collective action against Amazon over claims of data misuse

Featured Article

Rivian overhauled the R1S and R1T to entice new buyers ahead of cheaper R2 launch

Rivian has changed 600 parts on its R1S SUV and R1T pickup truck in a bid to drive down manufacturing costs, while improving performance of its flagship vehicles.  The end goal, which will play out over the coming year, is an existential one. Rivian lost about $38,784 on every vehicle…

5 hours ago
Rivian overhauled the R1S and R1T to entice new buyers ahead of cheaper R2 launch

Twitch has come up with a solution for the ongoing copyright issues that DJs encounter on the platform. The company announced Thursday a new program that enables DJs to stream…

Twitch DJs will now have to pay music labels to play songs in livestreams

Google said today it is partnering with RapidSOS, a platform for emergency first responders, to enable users to contact 911 through RCS (Rich Messaging Service).

Google partners with RapidSOS to enable 911 contact through RCS

Long before product-led growth became a buzzword, Atlassian offered free tiers for virtually all of its productivity and developer tools. Today, that mostly means free access for up to 10…

Atlassian now gives startups a year of free access

Featured Article

A social app for creatives, Cara grew from 40k to 650k users in a week because artists are fed up with Meta’s AI policies

Artists have finally had enough with Meta’s predatory AI policies, but Meta’s loss is Cara’s gain. An artist-run, anti-AI social platform, Cara has grown from 40,000 to 650,000 users within the last week, catapulting it to the top of the App Store charts. Instagram is a necessity for many artists,…

6 hours ago
A social app for creatives, Cara grew from 40k to 650k users in a week because artists are fed up with Meta’s AI policies

Google has developed a new AI tool to help marine biologists better understand coral reef ecosystems and their health, which can aid in conversation efforts. The tool, SurfPerch, created with…

Google looks to AI to help save the coral reefs

Only a few years ago, one of the hottest topics in enterprise software was ‘robotic process automation’ (RPA). It doesn’t feel like those services, which tried to automate a lot…

Tektonic AI raises $10M to build GenAI agents for automating business operations

SpaceX achieved a key milestone in its Starship flight test campaign: returning the booster and the upper stage back to Earth.

SpaceX launches mammoth Starship rocket and brings it back for the first time

There’s a lot of buzz about generative AI and what impact it might have on businesses. But look beyond the hype and high-profile deals like the one between OpenAI and…

Sirion, now valued around $1B, acquires Eigen as consolidation comes to enterprise AI tooling

Carlo Kobe and Scott Smith believed so strongly in the need for a debit card product designed specifically for Gen Zers that they dropped out of Harvard and Cornell at…

Kleiner Perkins leads $14.4M seed round into Fizz, a credit-building debit card aimed at Gen Z college students

A new app called MyGlimpact is intended not only to help people understand their environmental footprint, but why they shouldn’t feel guilty about it.

How many Earths does your lifestyle require?

Prolific Machines believes it has a way of transitioning away from molecules to something better: light.

Prolific Machines, with a $55M Series B, shines ‘light’ on a better way to grow lab proteins for food and medicine

It’s been 20 years since Shira Yevin, the lead singer of punk band Shiragirl drove a pink RV into the Vans Warped Tour grounds, the now-defunct punk rock festival notorious…

Punk singer Shira Yevin pushes for fair pay with InPink, a women-focused job marketplace

While the transport industry does use legacy software, many of these platforms are from an earlier era. Qargo hopes its newer technologies can help it leapfrog the competition.

Qargo raises $14M to digitize and decarbonize the trucking industry

When you look at how generative AI is being implemented across developer tools, the focus for the most part has been on generating code, as with Github Copilot. Greptile, an…

Greptile raises $4M to build an AI-fueled code base expert

The models tended to answer questions inconsistently, which reflects biases embedded in the data used to train the models.

Study finds that AI models hold opposing views on controversial topics

A growing number of businesses are embracing data models — abstract models that organize elements of data and standardize how they relate to one another. But as the data analytics…

Cube is building a ‘semantic layer’ for company data