Media & Entertainment

Meta sues Chinese company’s US subsidiary for scraping Facebook and Instagram data

Comment

Meta's content moderation in Africa in limbo
Image Credits: Bloomberg / Contributor / Getty Images

Facebook’s parent company Meta has announced that it’s suing the U.S. subsidiary of a Chinese tech company, accusing it of offering data-scraping services for Facebook and Instagram.

The social networking giant also revealed that it’s suing an individual, who the company alleges set up automated Instagram accounts to scrape data from some 350,000 Instagram users.

Both cases have been filed in the U.S. District Court for the Northern District of California.

Facebook versus scrapers

While Meta and other internet companies are no strangers to fighting web-scrapers — a practice that involves using automated tools to gather data en-masse from websites — the timing of these latest cases is particularly notable. It comes less than three months after a U.S. court reaffirmed an earlier ruling that web-scraping is legal, the culmination of a long-standing legal battle between Microsoft-owned LinkedIn and a data science company called Hiq Labs, which scraped personal information from LinkedIn to help its customers predict employee attrition.

While the outcome was celebrated by many across the industrial spectrum, including archivists, researchers and journalists who rely on scraping publicly available data, it also dealt a serious blow to legitimate privacy and security concerns around how people’s data can be harnessed without their permission. In this particular case, the court ruled that scraping publicly accessible information does not contravene the Computer Fraud and Abuse Act (CFAA), a cybersecurity law that governs computer hacking in the U.S.

Not to be deterred, Meta is now pursuing similar legal action against a company called Octopus Data, the U.S. offshoot of a “Chinese national high-tech enterprise” — the parent company’s website says that’s it’s called “Shenzhen Vision Information Technology Co.,” and it claims to have launched its core product in 2016.

In addition, Meta confirmed that it’s filing a suit against a Turkey-based individual going by the name of Ekrem Ateş, who allegedly published scraped Instagram data to their own websites, or so-called “clone sites.”

Rather than targeting the entities under the auspices of CFAA, Meta’s pursuing matters via the Digital Millennium Copyright Act (DMCA), which is more concerned with copyright and intellectual property (IP) infringements than hacking. With regards to this, in its court filing Meta specifically points to Section 3 of its terms of service, which state:

You own the intellectual property rights (things such as copyright or trademarks) in any such content that you create and share on Facebook and other Meta Company Products you use. Nothing in these Terms takes away the rights you have to your own content. You are free to share your content with anyone else, wherever you want.

Elsewhere, Facebook’s terms also state that:

You will not collect users’ content or information, or otherwise access Facebook, using automated means (such as harvesting bots, robots, spiders or scrapers) without our prior permission.

According to Meta, Octopus charges its customers a fee to access a software product called Octoparse to launch scraping attacks, or they can also pay Octopus to scrape websites directly. For it to work, customers must give access to their accounts, which allows the software to glean data that’s normally only available to logged-in users, including Facebook friends, email addresses, birth dates, phone numbers andInstagram followers, among other engagement data.

It’s also worth noting that Octoparse is not limited to Meta’s properties, either, with services offered across numerous sites including Twitter, YouTube, Amazon, LinkedIn and more.

“Our lawsuit alleges that Octopus has violated our Terms of Service and the Digital Millennium Copyright Act, by engaging in unauthorized and automated scraping and attempting to conceal their scraping and avoid being detected and blocked from Facebook and Instagram,” Jessica Romero, Meta’s director of platform enforcement and litigation, wrote in a blog post.

These latest instances come shortly after Meta emerged mostly victorious from another data-scraping case it filed some two years ago against an Israeli company called BrandTotal, which offered a browser extension that collected data from Facebook users. The judge in that case sided with Meta in its claim that BrandTotal breached the Facebook terms of use, while it also issued a summary judgement that BrandTotal violated CFAA or California’s CDAFA (Computer Data Access and Fraud Act) by accessing password-protected pages using fake user accounts.

Web-scraping is pretty much as old as the web itself, and it’s not something that will be going away any time soon. However, by targeting some of the worst offenders — both at a corporate and individual level — Meta wants to deter others from following suit.

More TechCrunch

In a series of posts on X on Thursday, Paul Graham, the co-founder of startup accelerator Y Combinator, brushed off claims that OpenAI CEO Sam Altman was pressured to resign…

Paul Graham claims Sam Altman wasn’t fired from Y Combinator

In its three-year history, EthonAI has amassed some fairly high-profile customers including Siemens and chocolate-maker Lindt.

AI manufacturing startup funding is on a tear as Switzerland’s EthonAI raises $16.5M

Don’t miss out: TechCrunch Disrupt early-bird pricing ends in 48 hours! The countdown is on! With only 48 hours left, the early-bird pricing for TechCrunch Disrupt 2024 will end on…

Ticktock! 48 hours left to nab your early-bird tickets for Disrupt 2024

Biotech startup Valar Labs has built a tool that accurately predicts certain treatment outcomes, potentially saving precious time for patients.

Valar Labs debuts AI-powered cancer care prediction tool and secures $22M

Archer Aviation is partnering with ride-hailing and parking company Kakao Mobility to bring electric air taxi flights to South Korea starting in 2026, if the company can get its aircraft…

Archer, Kakao Mobility partner to bring electric air taxis to South Korea in 2026

Space startup Basalt Technologies started in a shed behind a Los Angeles dentist’s office, but things have escalated quickly: soon it will try to “hack” a derelict satellite and install…

Basalt plans to “hack” a defunct satellite to install its space-specific OS

As a teen model, Katrin Kaurov became financially independent at a young age. Aleksandra Medina, whom she met at NYU Abu Dhabi, also learned to manage money early on. The…

Former teen model co-created app Frich to help Gen Z be more realistic about finances

Can an AI help you tell your story? That’s the idea behind a startup called Autobiographer, which leverages AI technology to engage users in meaningful conversations about the events in…

Autobiographer’s app uses AI to help you tell your life story

AI-powered summaries of webpages are a feature that you will find in many AI-centric tools these days. The next step for some of these tools is to prepare detailed and…

Perplexity AI’s new feature will turn your searches into shareable pages

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

A surge of battery recycling startups have emerged in Europe in a bid to tap into the next big opportunity in the EV market: battery waste.  Among them is Cylib,…

Cylib wants to own EV battery recycling in Europe

Amazon has received approval from the U.S. Federal Aviation Administration (FAA) to fly its delivery drones longer distances, the company announced on Thursday. Amazon says it can now expand its…

Amazon gets FAA approval to expand US drone deliveries

With Plannin, creators can tell their audience about their latest trip, which hotels they liked and post photos of their travels.

Former Priceline execs debut Plannin, a booking platform that uses travel influencers to help plan trips

Amazon is rolling out its AI voice search feature to Alexa, which lets it answer open-ended questions about content.

Amazon is rolling out AI voice search to Fire TV devices

Redpanda has already integrated Benthos into its own service and has made it the core technology of its new Redpanda Connect service.

Redpanda acquires Benthos to expand its end-to-end streaming data platform

It’s a lofty goal to take on legacy payments infrastructure, however, Forward’s model has an advantage by shifting the economics back to SaaS companies.

Fintech startup Forward grabs $16M to take on Stripe, lead future of integrated payments

Fertility remains a pressing concern around the world — birthrates are down in many countries, and infertility rates (that is, the ability to conceive at all) are up. And given…

Rhea reaps $10M more led by Thiel

Microsoft, Meta, Intel, AMD and others have formed a new group to design next-gen interconnects for AI accelerator hardware.

Tech giants form an industry group to help develop next-gen AI chip components

With JioFinance, the Indian tycoon Mukesh Ambani is making his boldest consumer-facing move yet into financial services.

Ambani’s Reliance fires opening salvo in fintech battle, launches JioFinance app

Salespeople live and die by commissions. It’s no surprise, then, that Salesforce paid a premium to buy a platform that simplifies managing commissions.

Filing shows Salesforce paid $419M to buy Spiff in February

YoLa Fresh works with over a thousand retailers across Morocco and records up to $1 million in gross merchandise volume.

YoLa Fresh, a GrubMarket for Morocco, digs up $7M to connect farmers with food sellers

Instagram is expanding the scope of its “Limits” tool specifically for teenagers that would let them restrict unwanted interactions with people.

Instagram now lets teens limit interactions to their ‘Close Friends’ group to combat harassment

Agritech company Iyris helps growers across eleven countries globally increase crop yields, reduce input costs, and extend growing seasons.

Iyris makes fresh produce easier to grow in difficult climates, raises $16M

Exactly.ai says it uses generative AI to help artists retain legal ownership of their art while being able to reproduce their designs faster and at scale.

Exactly.ai secures $4M to help artists use AI to scale up their output

FintechOS competes with other companies such as Ncino, Meridian Link, Abrigo and Backbase.

Romanian startup FintechOS raises $60M to help old banks fight back against neobanks

After two years of preparation and four delays over the past several months due to technical glitches, Indian space startup Agnikul has successfully launched its first sub-orbital test vehicle, powered…

India’s Agnikul launches 3D-printed rocket in sub-orbital test after initial delays

Struggling EV startup Fisker has laid off hundreds of employees in a bid to stay alive, as it continues to search for funding, a buyout or prepare for bankruptcy. Workers…

Fisker cuts hundreds of workers in bid to keep EV startup alive

Chinese EV manufacturers face a new challenge in their pursuit of U.S. customers: a new House bill that would limit or ban the introduction of their connected vehicles. The bill,…

Chinese EV makers, and their connected vehicles, targeted by new House bill

With the release of iOS 18 later this year, Apple may again borrow ideas third-party apps. This time it’s Arc that could be among those affected.

Is Apple planning to ‘sherlock’ Arc?

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! This is the startup world’s main event, and it’s where you’ll find the knowledge, tools…

Meet Visa, Mercury, Artisan, Golub Capital and more at TC Disrupt 2024