Featured Article

Valued at $1B, Kai-Fu Lee’s LLM startup unveils open source model

The venture capitalist and computer scientist wants to create an OpenAI equivalent for China

Comment

Image Credits: TechCrunch

Kai-Fu Lee, the computer scientist known in the West for his bestseller “AI Superpowers” and in China for his bets on artificial intelligence unicorns, has a new venture — and a great ambition.

In late March, Lee launched a company called 01.AI with the vision to develop a homegrown large language model for the Chinese market. The venture puts him in competition with other prominent Chinese tech leaders, including Sogou’s founder Wang Xiaochuan, who have been swiftly gathering talent and venture capital to establish China’s equivalents of OpenAI.

“I think necessity is the mother of innovation, and there’s clearly a huge necessity in China,” Lee, who’s 61 and leading 01.AI as CEO, told TechCrunch in an interview, explaining the motive behind starting the company. “Unlike the rest of the world, China doesn’t have access to OpenAI and Google because those two companies did not make their products available in China, so I think many doing LLM are trying to do their part in creating a solution for a market that really needs this.”

01.AI’s growth is a fitting reflection of the rapid development in the generative AI field. Seven months after its founding, the startup has released its first model, the open source Yi-34B. The decision to introduce an open LLM as its debut product is a way to “give back” to society, said Lee. For people who have felt LLaMA is a “godsend” to them, “we’ve provided a compelling alternative,” he added.

As of writing, Yi-34B, which is a bilingual (English and Chinese) base model trained with 34 billion parameters and significantly smaller than other open models like Falcon-180B and Meta LlaMa2-70B, came in first amongst pre-trained LLM models, according to a ranking by Hugging Face.

“We still believe that larger models, when trained well, on a large amount of high-quality data, will always outperform substantially smaller models of comparable quality and comparable technology, so I think [Yi-34B] outperforming much larger models is something that we don’t usually see,” said Lee. “We feel quite confident as we released models that are 100 billion to 400 billion over the next coming year, year and a half, these models will be dramatically better than today’s model that we announced.”

The startup’s ability to commence model training quickly is no doubt an outcome of its smooth fundraising, which is critical to securing top-tier talent and AI processors. While declining to disclose how much 01.AI has raised, Lee said it’s valued at $1 billion after receiving financing from Sinovation Ventures, Alibaba Cloud and other undisclosed investors.

01.AI has already grown to more than 100 employees, over half of whom are LLM experts from major multinational and Chinese tech firms. Its vice president of technology, for instance, is an early member of Google’s Bard, and its chief architect was a founding member of TensorFlow and worked alongside renowned researchers like Jeff Dean and Samy Bengio at Google Brain. The key figures behind Yi-34B are Wenhao Huang, a Microsoft Research Asia veteran, and Ethan Dai, who held senior AI positions at Huawei and Alibaba.

Having backed over 10 unicorns and venture-built seven companies through Sinovation Ventures, Lee is possibly one of the most well-connected investors and entrepreneurs in China.

“It’s been, you know, over 25 years since the founding of Microsoft Research Asia, and everything I’ve done has been about getting super great talent,” said Lee, who launched Microsoft Research Asia, the U.S. giant’s biggest research center abroad, before spearheading Google China. Over the years, Microsoft Research Asia has earned the reputation as the “West Point” for nurturing China’s AI entrepreneurs.

“Now, of course, you want to pay people fairly, and you need to be competitive in pay, but I really think that it’s also about people believing they can make a difference and believing the company can succeed,” said Lee, appearing in a video call at 9:30 p.m. Beijing time. His staff were just as self-motivating. One of the startup’s infrastructure experts was working well into the wee hours that day, still messaging Lee at 2:15 a.m. to express his excitement about being part of 01.AI’s mission.

It’s no secret that building LLMs is a costly undertaking. To sustain its cash-intensive operations, 01.AI has plans for monetization right from the start. While the company will continue to open source some of its models, its objective is to build a state-of-the-art proprietary model that serves as a foundation for a diverse range of commercial products.

“We can’t open source everything,” said Lee. “We were quite cognizant of the fact that these large language models require a lot of compute, and therefore, are very expensive. When we raise a lot of money, most of it will be spent on the GPU. Given that, we needed to first acquire as much GPU as we could, which we did.”

Like other LLM players in China, 01.AI has proactively stockpiled GPUs in anticipation of U.S. sanctions; it borrowed money to buy processors even before it landed funding. Over the past year, the Biden administration has heightened restrictions on China’s access to high-end AI chips, prompting Chinese firms to pay inflated prices for chips. The foresight was rewarded — 01.AI now has a supply that will suffice for at least the next 12-18 months.

Aside from causing headaches for Chinese firms, U.S. sanctions have been a catalyst for innovation by encouraging them to optimize the use of computing power. “With a very high-quality infrastructure team, for every 1,000 GPUs, we might be able to squeeze 2,000 GPUs workload out of them,” said Lee.

01.AI’s path to monetization hinges largely on its ability to find product-market fit for its expensive AI models. While top-notch LLM scientists are scarce, there’s no shortage of product talent in China.

“China’s not ahead of the U.S. in LLM, but there’s no doubt China can build better applications than American developers mostly because of the phenomenal mobile internet ecosystem that was built over the last 12 years or so,” argued Lee.

While the founder gave no details on the services in the pipeline, he hinted that the company is experimenting with concepts in the productivity and social directions, and he’d be “disappointed” if 01.AI didn’t release an app within this calendar year.

The startup’s ultimate goal, according to Lee, is to become an ecosystem where outside developers can build applications easily. “The duty is not just to push out good research models, but even more importantly to make application development easy so that there can be compelling applications,” he said. “At the end of the day. It is an ecosystem play.” Time will tell if Lee’s AI endeavor will pay off.

China’s search engine pioneer unveils open source large language model to rival OpenAI

Meituan buys founder’s months-old ‘OpenAI for China’ for $234M

More TechCrunch

The keynote will be focused on Apple’s software offerings and the developers that power them, including the latest versions of iOS, iPadOS, macOS, tvOS, visionOS and watchOS.

Watch Apple kick off WWDC 2024 right here

As WWDC 2024 nears, all sorts of rumors and leaks have emerged about what iOS 18 and its AI-powered apps and features have in store.

What to expect from Apple’s AI-powered iOS 18 at WWDC 2024

Welcome to Elon Musk’s X. The social network formerly known as Twitter where the rules are made up and the check marks don’t matter. Or do they? The Tesla and…

Elon Musk’s X: A complete timeline of what Twitter has become

TechCrunch has kept readers informed regarding Fearless Fund’s courtroom battle to provide business grants to Black women. Today, we are happy to announce that Fearless Fund CEO and co-founder Arian…

Fearless Fund’s Arian Simone coming to Disrupt 2024

Bridgy Fed is one of the efforts aimed at connecting the fediverse with the web, Bluesky and, perhaps later, other networks like Nostr.

Bluesky and Mastodon users can now talk to each other with Bridgy Fed

Zoox, Amazon’s self-driving unit, is bringing its autonomous vehicles to more cities.  The self-driving technology company announced Wednesday plans to begin testing in Austin and Miami this summer. The two…

Zoox to test self-driving cars in Austin and Miami 

Called Stable Audio Open, the generative model takes a text description and outputs a recording up to 47 seconds in length.

Stability AI releases a sound generator

It’s not just instant-delivery startups that are struggling. Oda, the Norway-based online supermarket delivery startup, has confirmed layoffs of 150 jobs as it drastically scales back its expansion ambitions to…

SoftBank-backed grocery startup Oda lays off 150, resets focus on Norway and Sweden

Newsletter platform Substack is introducing the ability for writers to send videos to their subscribers via Chat, its private community feature, the company announced on Wednesday. The rollout of video…

Substack brings video to its Chat feature

Hiya, folks, and welcome to TechCrunch’s inaugural AI newsletter. It’s truly a thrill to type those words — this one’s been long in the making, and we’re excited to finally…

This Week in AI: Ex-OpenAI staff call for safety and transparency

Ms. Rachel isn’t a household name, but if you spend a lot of time with toddlers, she might as well be a rockstar. She’s like Steve from Blues Clues for…

Cameo fumbles on Ms. Rachel fundraiser as fans receive credits instead of videos  

Cartwheel helps animators go from zero to basic movement, so creating a scene or character with elementary motions like taking a step, swatting a fly or sitting down is easier.

Cartwheel generates 3D animations from scratch to power up creators

The new tool, which is set to arrive in Wix’s app builder tool this week, guides users through a chatbot-like interface to understand the goals, intent and aesthetic of their…

Wix’s new tool taps AI to generate smartphone apps

ClickUp Knowledge Management combines a new wiki-like editor and with a new AI system that can also bring in data from Google Drive, Dropbox, Confluence, Figma and other sources.

ClickUp wants to take on Notion and Confluence with its new AI-based Knowledge Base

New York City, home to over 60,000 gig delivery workers, has been cracking down on cheap, uncertified e-bikes that have resulted in battery fires across the city.  Some e-bike providers…

Whizz wants to own the delivery e-bike subscription space, starting with NYC

This is the last major step before Starliner can be certified as an operational crew system, and the first Starliner mission is expected to launch in 2025. 

Boeing’s Starliner astronaut capsule is en route to the ISS 

TechCrunch Disrupt 2024 in San Francisco is the must-attend event for startup founders aiming to make their mark in the tech world. This year, founders have three exciting ways to…

Three ways founders can shine at TechCrunch Disrupt 2024

Google’s newest startup program, announced on Wednesday, aims to bring AI technology to the public sector. The newly launched “Google for Startups AI Academy: American Infrastructure” will offer participants hands-on…

Google’s new startup program focuses on bringing AI to public infrastructure

eBay’s newest AI feature allows sellers to replace image backgrounds with AI-generated backdrops. The tool is now available for iOS users in the U.S., U.K., and Germany. It’ll gradually roll…

eBay debuts AI-powered background tool to enhance product images

If you’re anything like me, you’ve tried every to-do list app and productivity system, only to find yourself giving up sooner than later because sooner than later, managing your productivity…

Hoop uses AI to automatically manage your to-do list

Asana is using its work graph to train LLMs with the goal of creating AI assistants that work alongside human employees in company workflows.

Asana introduces ‘AI teammates’ designed to work alongside human employees

Taloflow, an early stage startup changing the way companies evaluate and select software, has raised $1.3M in a seed round.

Taloflow puts AI to work on software vendor selection to reduce costs and save time

The startup is hoping its durable filters can make metals refining and battery recycling more efficient, too.

SiTration uses silicon wafers to reclaim critical minerals from mining waste

Spun out of Bosch, Dive wants to change how manufacturers use computer simulations by both using modern mathematical approaches and cloud computing.

Dive goes cloud-native for its computational fluid dynamics simulation service

The tension between incumbents and fintechs has existed for decades. But every once in a while, the two groups decide to put their competition aside and work together. In an…

When foes become friends: Capital One partners with fintech giants Stripe, Adyen to prevent fraud

After growing 500% year-over-year in the past year, Understory is now launching a product focused on the renewable energy sector.

Insurance provider Understory gets into renewable energy following $15M Series A

Ashkenazi will start her new role at Google’s parent company on July 31, after 23 years at Eli Lilly.

Alphabet brings on Eli Lilly’s Anat Ashkenazi as CFO

Tobiko aims to reimagine how teams work with data by offering a dbt-compatible data transformation platform.

With $21.8M in funding, Tobiko aims to build a modern data platform

In 1816, French physician René Laennec invented an instrument that allowed doctors to listen to the heart and lungs. That device — a stethoscope — eventually evolved from a simple…

Eko Health scores $41M to detect heart and lung disease earlier and more accurately

The number of satellites on low Earth orbit is poised to explode over the coming years as more mega-constellations come online. This will create new opportunities for bad actors to…

DARPA and Slingshot build system to detect ‘wolf in sheep’s clothing’ adversary satellites