Leaked Facebook ads document raises fresh questions over GDPR enforcement

Comment

Facebook and Meta logos
Image Credits: Chesnot / Getty Images

Motherboard/Vice had an explosive report on Facebook’s business yesterday that’s sure to raise fresh questions over the lack of enforcement of European privacy laws against the adtech giant.

The report is based on a leaked internal document written last year by privacy engineers on its Ad and Business product team.

The document, which is entitled “ABP Privacy Infra, Long Range Investments [A/C Priv],” appears to show engineers at the tech giant now known as Meta scratching their heads at the nightmarish task they’re facing: Trying to make Facebook’s data-ingesting ads business compliant with a “tsunami” of global privacy regulations that need it to know how user data flows through its systems so the company can apply policies that control what’s done with people’s information and perform basic stuff like reflect people’s privacy choices. So next time Sheryl Sandberg talks about Meta’s “regulatory headwinds” this is the contextual meat to graft on those euphemistic bones.

Meta’s text deploys some internal business shorthand/acronyms whose literal meanings aren’t always clear. But the gist of the read — and it’s worth reading in full if you can spare the time for 15-pages of text, diagrams and a few colorful analogies such as one comparing a person’s information to a bottle of ink being poured into a giant lake (oopsy!) — is that Meta has ‘designed’ its ad system in such a totally unsiloed way that it’s very, very, very far from being able to comply with (even existing) laws like Europe’s General Data Protection Regulation (GDPR) which has a purpose limitation principle meaning you need a legal basis for each use of personal data. Nor, per the document, do Meta’s engineers sound confident of being able to transform the mess and achieve timely compliance with a bunch of other, incoming global regulations either. (And don’t even get them started on what AI regulations might mean for the business.)

Meta disputes that the document shows non-compliance with any privacy laws, of course.

In a statement to Motherboard, the company claims the document “does not describe our extensive processes and controls to comply with privacy regulations”; adding therefore that “it’s simply inaccurate to conclude that it demonstrates non-compliance”; and further claiming: “New privacy regulations across the globe introduce different requirements and this document reflects the technical solutions we are building to scale the current measures we have in place to manage data and meet our obligations.” 

But, well, they would say that, wouldn’t they? 

Independent privacy researcher, Wolfie Christl — an expert in forensic analysis of ad data flows — takes a different view of what the leaked document reveals — dubbing it “dynamite” and a “confession” (albeit one not intended by Meta for public consumption) that it does not comply with the GDPR. See his detailed Twitter thread here — where he unpacks and contextualizes the implications of the engineers’ observations, as he sees it.

“The document is a straight and clear confession that Facebook’s whole business is based on a massive GDPR violation at the most fundamental level,” Christl tells TechCrunch. “Purpose limitation is one of the most basic principles in the GDPR. A company can generally only collect personal data for a specified purpose. If a company cannot specify the purpose it collects personal data for, it is simply not allowed to process it under the GDPR.”

Asked what Meta’s lead data protection regulator in the EU should do, Christl adds: “The Irish regulator must take action now. If Facebook cannot make clear how exactly its surveillance advertising machine uses personal data, it must be ordered to stop processing it.”

TechCrunch contacted the Irish Data Protection Commission (DPC) to ask whether it will be opening an investigation into Meta’s ad data flows in light of what the document appears to show is, basically, an ads system that, either by design or systemic build creep, exists (or existed in 2021) in a state that’s antithetical to regulation — and, indeed, whether the document is of relevance to any of the (several) ongoing investigations it has into aspects of Facebook’s business.

The regulator did not provide a statement but deputy commissioner Graham Doyle confirmed it had only seen the document for the first time when Motherboard/Vice published it.

That may raise further questions, given the DPC has — on paper — been investigating whether Facebook’s ads business complies with the GDPR’s requirement to have a valid legal basis for processing people’s data for almost four years now.

For example, the DPC has been considering a complaint against Facebook, focused on its legal basis for processing user data for ads, since May 2018, when the regulation entered into force.

A draft DPC decision on that inquiry, which was published (not by the DPC) last fall, was quickly branded a joke by privacy campaigners as the regulator appeared to be intending to accept a tactic by Meta to evade the GDPR’s standard for consent-based processing by claiming a cunning contractual bypass.

The tl;dr here is that for consent to be valid under the GDPR, data subjects must be given a free choice. Consent must also be purpose specific (aka no bundling); and it must be informed.

None of which happens if you use Facebook — where the platform makes processing your information for ad targeting a condition of use. Click ‘agree to ads’ or no Facebook account for you.

But, per last year’s leaked draft DPC decision, Facebook claims users are actually in a contract with it to receive targeted ads — and the DPC didn’t appear to see reason to object to that GDPR-bypassing construction.

Given GDPR complaints are still floundering on such legal basics, is it any wonder that the deep, dark, underbelly of Meta’s ad-targeting machinery contains, as this document tells it, a vast ocean of surveillance data on web users but so little apparatus to order this information according to people’s own wishes?

The bottom line is that the EU is almost four years into enforcement of its ‘flagship’ data protection regime and Facebook itself remains untouched by GDPR enforcement. (Its messaging platform WhatsApp was hit by a fine last year.)

The European Union also didn’t suddenly invent privacy regulation in 2018, when the GDPR came into force. Before that law there was the Data Protection Directive, which included many of the same principles.

So — in Europe at least — if a company like Facebook had actually been paying attention to legal requirements around privacy by design — and if EU regulators had been muscularly enforcing these long-standing rules — Meta might not now be warning investors about the ‘regulatory headwinds’ coming for their shareholder value. Nor facing what sounds to be a monumentally expensive and resource intensive re-engineering challenge — not so much akin to landing on the moon as more like needing to reconstruct the whole of the planet from pulverized moondust in a way that ensures every tiny piece of rock and dust is put back in exactly the place it originated for. Oh, and — guess what! — the deadline for doing all that already passed. Call it the ‘Zuckerberg’s moonshot.’

A Meta spokesperson did not respond to a question asking whether, following the Motherboard report, it had contacted the DPC to provide its lead EU regulator with information on how its ads system functions.

The company sent us the same statement it provided Motherboard earlier, which concludes with this lament: “This analogy lacks the context that we do, in fact, have extensive processes and controls to manage data and comply with privacy regulations.”

The European Commission is ultimately responsible for monitoring the application of the GDPR by EU Member State agencies.

We asked the Commission if it had any concerns in light of the leaked document and/or a view on whether the DPC should open an investigation into Meta’s ads data flows. But at the time of writing it had not responded.

In February, following a complaint against the Commission by the Irish Council for Civil Liberties — which accuses the EU’s executive of neglecting its duty to act on Ireland’s “failure to properly apply” the GDPR — the EU’s ombudsperson opened an inquiry — giving the Commission until May 15 to provide it with a “detailed and comprehensive” account of the information it has collected so far around whether the regulation is applied “in all respects” in Ireland.

Ireland’s draft GDPR decision against Facebook branded a joke

More TechCrunch

Blue Origin’s New Shepard rocket will take a crew to suborbital space for the first time in nearly two years later this month, the company announced on Tuesday.  The NS-25…

Blue Origin to resume crewed New Shepard launches on May 19

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper

People can now search using a video they upload combined with a text query to get an AI overview of the answers they need.

Google experiments with using video to search, thanks to Gemini AI

A search results page based on generative AI as its ranking mechanism will have wide-reaching consequences for online publishers.

Google will soon start using GenAI to organize some search results pages

Google has built a custom Gemini model for search to combine real-time information, Google’s ranking, long context and multimodal features.

Google is adding more AI to its search results

At its Google I/O developer conference, Google on Tuesday announced the next generation of its Tensor Processing Units (TPU) AI chips.

Google’s next-gen TPUs promise a 4.7x performance boost

Google is upgrading Gemini, its AI-powered chatbot, with features aimed at making the experience more ambient and contextually useful.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals

Veo can generate few-seconds-long 1080p video clips given a text prompt.

Google’s image-generating AI gets an upgrade

At Google I/O, Google announced upgrades to Gemini 1.5 Pro, including a bigger context window. .

Google’s generative AI can now analyze hours of video

The AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google Photos introduces an AI search feature, Ask Photos

Apple released new data about anti-fraud measures related to its operation of the iOS App Store on Tuesday morning, trumpeting a claim that it stopped over $7 billion in “potentially…

Apple touts stopping $1.8B in App Store fraud last year in latest pitch to developers