When Facebook Knows You Better Than You Know Yourself

Comment

Image Credits: vigneshkumar (opens in a new window) / Wikimedia Commons (opens in a new window) under a CC BY 2.0 (opens in a new window) license.

Jon Evans

Contributor

Jon Evans is the CTO of the engineering consultancy HappyFunCorp; the award-winning author of six novels, one graphic novel, and a book of travel writing; and TechCrunch’s weekend columnist since 2010.

More posts from Jon Evans

Every time you log in to Facebook, every time you click on your News Feed, every time you Like a photo, every time you send anything via Messenger, you add another data point to the galaxy they already have regarding you and your behavior. That, in turn, is a tiny, insignificant dot within their vast universe of information about their billion-plus users.

It is probable that Facebook boasts the broadest, deepest, and most comprehensive dataset of human information, interests, and activity ever collected. (Only the NSA knows for sure.) Google probably has more raw data, between Android and searches–but the data they collect is (mostly) much less personal. Of all the Stacks, I think it’s fair to say, Facebook almost certainly knows you best.

They can use this data for advertising, which is contentious, I suppose; but much worse, it’s boring. What’s long been more interesting to me is the possibility of interpolating from this data, i.e. deducing from your online behavior things that you never explicitly revealed to Facebook–and extrapolating from it, i.e. predicting your reactions to new information and new situations. What’s interesting is the notion that Facebook might be able to paint an extraordinarily accurate pointillist picture of you, with all the data points you give it as the pixels.

That’s pretty abstract. Let’s try a couple of concrete examples. Imagine that Facebook could figure out with a high degree of confidence, from the way you use its app and site, from the links and photos you post, the apps you use, and the stuff you Like, whether you’re a hard worker or a shirker, and whether you’re a good or bad credit/insurance risk. Interesting stuff, to a would-be employer and/or a would-be insurer, no?

And not near as futuristic as it may sound. Your phone can tell whether you’re depressed. Algorithms are already being used to judge our character, and can determine whether your relationship is in trouble based on your collective social graph.

And Facebook just keeps expanding its remit of data. As of this week, you can search all of its trillions of posts — meaning that it can and will add more and more search data to what it knows.

One wonders whether, and how much, it will actually use this data, though. After all, if and when people discover that they inadvertently reveal things they may wish to keep private by simply being themselves on Facebook … they may well decide to stop being themselves on Facebook. Which will mean less candor, less sharing, more forethought and judiciousness — and less time spent on Facebook.

On the other hand, instead of making it clear what they know about us all, they may well simply use this information in an opaque way, to continue increasing their reach and their profits:

…in which case Facebook will become a kind of one-way mirror, one that may ultimately literally know you better than you know yourself. Which in turn raises fascinating and disturbing ethical questions worth of a Philip K. Dick (or Kafka) novel — what if Facebook’s deep neural networks predict, based on your behavior, that you’re going to commit suicide? What if they predict that you’re going to kill someone else? What if they have 90% confidence? What if they’re wrong?

I don’t pretend to have the answers. But I think it’s worth considering the possibility that human data on this scale will in the not-too-distant future act as both an X-ray, revealing things about ourselves that we had thought secret, and a searchlight, illuminating what we’re likely to do next.

More TechCrunch

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper

People can now search using a video they upload combined with a text query to get an AI overview of the answers they need.

Google experiments with using video to search, thanks to Gemini AI

A search results page based on generative AI as its ranking mechanism will have wide-reaching consequences for online publishers.

Google will soon start using GenAI to organize some search results pages

Google has built a custom Gemini model for search to combine real-time information, Google’s ranking, long context and multimodal features.

Google is adding more AI to its search results

At its Google I/O developer conference, Google on Tuesday announced the next generation of its Tensor Processing Units (TPU) AI chips.

Google’s next-gen TPUs promise a 4.7x performance boost

Google is upgrading Gemini, its AI-powered chatbot, with features aimed at making the experience more ambient and contextually useful.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals

Veo can generate few-seconds-long 1080p video clips given a text prompt.

Google’s image-generating AI gets an upgrade

At Google I/O, Google announced upgrades to Gemini 1.5 Pro, including a bigger context window. .

Google’s generative AI can now analyze hours of video

The AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google Photos introduces an AI search feature, Ask Photos

Apple released new data about anti-fraud measures related to its operation of the iOS App Store on Tuesday morning, trumpeting a claim that it stopped over $7 billion in “potentially…

Apple touts stopping $1.8B in App Store fraud last year in latest pitch to developers

Online travel agency Expedia is testing an AI assistant that bolsters features like search, itinerary building, trip planning, and real-time travel updates.

Expedia starts testing AI-powered features for search and travel planning

Welcome to TechCrunch Fintech! This week, we look at the drama around TabaPay deciding to not buy Synapse’s assets, as well as stocks dropping for a couple of fintechs, Monzo raising…

Inside TabaPay’s drama-filled decision to abandon its plans to buy Synapse’s assets