Startups

Sweep aims to automate basic dev tasks using large language models

Comment

Software engineers programmer development coding a solution data.
Image Credits: Chalirmpoj Pimpisarn / EyeEm / Getty Images

Developers spend a lot of time on mundane, repetitive tasks — and surprisingly little on actual coding.

In Stack Overflow’s 2022 developer survey, 63% of respondents said that they devote more than 30 minutes a day searching for answers or solutions to problems — which adds up to between 333 to 651 hours of time lost per week across a team of 50 developers. A separate poll from Propeller Insights and Rollbar found that over a third of developers spend around a quarter of their time fixing bugs, with slightly more than a quarter (26%) setting aside up to half their time fixing bugs.

The trend frustrated William Zeng and Kevin Lu. So earlier this year, they — both veterans of Roblox, the video-game-turned-social-network — created a platform called Sweep to autonomously handle dev tasks like high-level debugging.

“We started Sweep after working at Roblox together and constantly dealing with software chores we knew could be automated with AI,” Zeng, Sweep’s CEO, told TechCrunch in an email interview. “Sweep is like an AI-powered junior dev for software teams.”

TechCrunch previously covered Sweep during Y Combinator’s Summer 2023 Demo Day. But since then, the startup has closed a new financing round, raising $2 million from Goat Capital, Replit CEO Amjad Masad, Replit VP of AI Michele Catasta and Exceptional Capital at a $25 million post-money valuation.

Sweep allows devs to describe a request in natural language — for example, “add debug logs to my data pipeline” — outside of an IDE and generate the corresponding code. The platform can then push that code to the appropriate codebase via a pull request, and address comments made on the pull request either from code maintainers or owners — a bit like GitHub Copilot, but more autonomous.

“Sweep allows engineers to ship faster,” Zeng said. “We’ll handle tech debt accumulated with every code change, such as improving error logs and adding unit tests in addition to refactoring inefficient code.”

Sweep, which specializes in writing Python code, leverages a combination of AI models for code generation. They include OpenAI’s GPT-4, but also a custom “code search engine” — importantly not trained on Sweep customer data, Zeng says — that helps plan and execute “repository-wide” code changes.

“We built our own code search engine for Python, which leverages lexical and vector search techniques,” Zeng added. Lexical search looks for literal matches — or slight variations on — portions of code, while vector search can find more loosely related code that shares certain characteristics. “We have one of the best unit test generation abilities available and will run and execute tests in real time,” he continued.

In the future, Sweep plans to beef up its platform’s code generation capabilities with StarCoder, the open source code-generating model from Hugging Face and ServiceNow.

Given AI’s tendency to make mistakes, though, I’m a little skeptical of Sweep’s reliability over the long run. A Stanford-affiliated research team found that engineers who use AI tools are more likely to cause security vulnerabilities in their apps because the tools often generate code that appears to be superficially correct but poses security issues.

There’s also the copyright question. Some code-generating models — not necessarily StarCoder or Sweep’s own, but others — are trained on copyrighted or code under a restrictive license, and these models can regurgitate this code when prompted in a certain way. Legal experts have argued that these tools could put companies at risk if they were to unwittingly incorporate copyrighted suggestions from the tools into their production software.

Sweep’s solution is prompting users to review and edit any generated code themselves before pushing changes to the target master codebase.

“The main challenges affecting AI developer tools are around reliability and managing large codebases,” Zeng said. “We’re using our knowledge around both older and newer methods to make Sweep robust.”

Sweep charges a pretty penny for its services — $480 per seat per month. (By contrast, the business-focused tiers for GitHub Copilot and Amazon CodeWhisperer cost around $20 per user per month.) But that hasn’t dissuaded customers apparently. Zeng claims that Sweep, with a rather humble war chest totaling $2.8 million, has enough capital coming in from clientele to “last the company years.”

“The new money will be for expanding our team in the coming year from two employees to five,” he continued. “We’re going to continue focusing on Python, and improving across all areas of tech debt from unit testing, refactoring and handling leftover to-dos in the code.”

More TechCrunch

After two years of preparation and four delays over the past several months due to technical glitches, Indian space startup Agnikul has successfully launched its first sub-orbital test vehicle, powered…

India’s Agnikul launches 3D-printed rocket in sub-orbital test after initial delays

Struggling EV startup Fisker has laid off hundreds of employees in a bid to stay alive, as it continues to search for funding, a buyout or prepare for bankruptcy. Workers…

Fisker cuts hundreds of workers in bid to keep EV startup alive

Chinese EV manufacturers face a new challenge in their pursuit of U.S. customers: a new House bill that would limit or ban the introduction of their connected vehicles. The bill,…

Chinese EV makers, and their connected vehicles, targeted by new House bill

With the release of iOS 18 later this year, Apple may again borrow ideas third-party apps. This time it’s Arc that could be among those affected.

Is Apple planning to ‘sherlock’ Arc?

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! This is the startup world’s main event, and it’s where you’ll find the knowledge, tools…

Meet Visa, Mercury, Artisan, Golub Capital and more at TC Disrupt 2024

Featured Article

The women in AI making a difference

As a part of a multi-part series, TechCrunch is highlighting women innovators — from academics to policymakers —in the field of AI.

12 hours ago
The women in AI making a difference

Cadillac may seem a bit too traditional to hang its driving cap on EVs. And yet, that hasn’t stopped the GM brand from rolling out — or at least showing…

The Cadillac Optiq EV starts at $54,000 and is designed to hook young hipsters

Ifeel is being offered as part of an employer’s or insurance provider’s healthcare coverage.

Mental health insurance platform ifeel raises a $20 million Series B

Instead of opening the user’s actual browser or a WebView, Custom Tabs let users remain in their app while browsing.

Google Chrome becomes a ‘picture-in-picture’ app

Sanil Chawla remembers the meetings he had with countless artists in college. Those creatives were looking for one thing: sustainable economic infrastructure that could help them scale rather than drown…

Slingshot raises $2.2 million to provide financial services to artists

A startup called Firefly that’s tackling the thorny and growing issue of cloud asset management with an “infrastructure as code” solution has raised $23 million in funding. That comes on…

Firefly forges on after co-founder murdered by Hamas

Mistral, the French AI startup backed by Microsoft and valued at $6 billion, has released its first generative AI model for coding, dubbed Codestral. Like other code-generating models, Codestral is…

Mistral releases Codestral, its first generative AI model for code

Pinterest announced today that it is evolving its Creator Inclusion Fund to now be called the Pinterest Inclusion Fund. Pinterest teamed up with Shopify’s Build Black and Build Native programs…

Pinterest expands its Creator Fund to allow founders

Alex Taub, a longtime founder with multiple exits under his belt, believes it’s time to disrupt the meme industry. “I have this big thesis that meme tech is going to…

This founder says meme tech is the next big thing

Lux, the startup behind popular pro photography app Halide and others, is venturing into video with its latest app launch. On Wednesday, the company announced Kino, a new video capture app…

Kino is a new iPhone app for videographers from the makers of Halide

DevOps startup Harness has shown itself to be an ambitious company, building a broad platform of services while also dabbling in M&A when it made sense to fill in functionality.…

Harness snags Split.io as it goes all in on feature flags and experiments

Microsoft’s Copilot, a generative AI-powered tool that can generate text as well as answer specific questions, is now available as an in-app chatbot on Telegram, the instant messaging app.  Currently…

Microsoft’s Copilot is now on Telegram

HBO’s new documentary, “MoviePass, MovieCrash,” tells a story that many of us know about: how MoviePass, the subscription-based movie ticketing startup, was a catastrophic failure. After a series of mishaps…

MoviePass co-founders speak their truth in HBO’s new documentary 

The watch features a variety of different 3D games, unlocking more play time the more kids move.

Fitbit’s new kid smartwatch is a little Wiimote, a little Tamagotchi

In the video, a crowd is roaring at a packed summer music festival. As a beat starts playing over the speakers, the performer finally walks onstage: It’s the Joker. Clad…

Discord has become an unlikely center for the generative AI boom

After the Wirecard scandal, Germany’s financial regulator BaFin started to look more closely at young fintech startups that wanted to grow at a rapid pace — it’s better to be…

Germany’s financial regulator ends anti-money laundering cap on N26 signups after $10M fine

Among other things, this includes the ability to trace code from source to binary packages across both platforms, single sign-on support and unified project structures.

JFrog and GitHub team up to closely integrate their source code and binary platforms

The company’s public fund disbursement and e-commerce platform makes accepting school tuition and enabling educational enrichment more accessible. 

Tech startup Odyssey goes on journey to help states implement school choice programs

A new startup called Kinnect aims to help people privately save generational memories, traditions, recipes and more. The company’s app, launched this month, lets people create invite-only spaces where they…

Kinnect’s new app aims to help families record and store generational memories

Spotify has hiked its premium subscription in France by an eye-watering €0.13, in response to a new music-streaming tax.

Spotify hikes subscription price in France by 1.2% to match new music-streaming tax

The European Union has taken the wraps off the structure of the new AI Office, the ecosystem-building and oversight body that’s being established under the bloc’s AI Act. The risk-based…

With the EU AI Act incoming this summer, the bloc lays out its plan for AI governance

Solutions by Text, a company that gives people a way to pay their bills and apply for loans via text messaging, has secured $110 million in new growth funding. Edison…

Bootstrapped for over a decade, this Dallas company just secured $110M to help people pay bills by text

Owners of small- and medium-sized businesses check their bank balances daily to make financial decisions. But it’s entrepreneur Yoseph West’s assertion that there’s typically information and functions missing from bank…

Relay raises $32.2 million to help smaller businesses manage their cash flow

When other firms were investing and raising eye-popping sums, Clean Energy Ventures took a different approach. It appears to be paying off.

How Clean Energy Ventures avoided the pandemic bubble and raised a $305M fund

PwC, the management consulting giant, will become OpenAI’s biggest customer to date, covering 100,000 users.

OpenAI signs 100K PwC workers to ChatGPT’s enterprise tier as PwC becomes its first resale partner