MegaPixels
Brainwash Dataset
Brainwash is a dataset of webcam images taken from the Brainwash Cafe in San Francisco
The Brainwash dataset includes 11,917 images of "everyday life of a busy downtown cafe" and is used for training head detection surveillance algorithms

Brainwash Dataset

Brainwash is a dataset of livecam images taken from San Francisco's Brainwash Cafe. It includes 11,917 images of "everyday life of a busy downtown cafe" 1 captured at 100 second intervals throughout the entire day. The Brainwash dataset includes 3 full days of webcam images taken on October 27, November 13, and November 24 in 2014. According the author's research paper introducing the dataset, the images were acquired with the help of Angelcam.com. 2

The Brainwash dataset is unique because it uses images from a publicly available webcam that records people inside a privately owned business without any consent. No ordinary cafe customer would ever suspect that their image would end up in dataset used for surveillance research and development, but that is exactly what happened to customers at Brainwash cafe in San Francisco.

Although Brainwash appears to be a less popular dataset, it was notably used in 2016 and 2017 by researchers affiliated with the National University of Defense Technology in China for two research projects on advancing the capabilities of object detection to more accurately isolate the target region in an image. 3 4 The National University of Defense Technology is controlled by China's top military body, the Central Military Commission.

The dataset also appears in a 2017 research paper from Peking University for the purpose of improving surveillance capabilities for "people detection in the crowded scenes".

 An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The dataset contains a total of 11,917 images and 81,973 annotated heads.  Credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)
An sample image from the Brainwash dataset used for training face and head detection algorithms for surveillance. The dataset contains a total of 11,917 images and 81,973 annotated heads. Credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)
 A visualization of the active regions for 81,973 head annotations in the Brainwash dataset training partition. Credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)
A visualization of the active regions for 81,973 head annotations in the Brainwash dataset training partition. Credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)

Who used Brainwash Dataset?

This bar chart presents a ranking of the top countries where dataset citations originated. Mouse over individual columns to see yearly totals. These charts show at most the top 10 countries.

Biometric Trade Routes

To help understand how Brainwash Dataset has been used around the world by commercial, military, and academic organizations; existing publicly available research citing Brainwash Dataset was collected, verified, and geocoded to show the biometric trade routes of people appearing in the images. Click on the markers to reveal research projects at that location.

Citation data is collected using SemanticScholar.org then dataset usage verified and geolocated.

Dataset Citations

The dataset citations used in the visualizations were collected from Semantic Scholar, a website which aggregates and indexes research papers. Each citation was geocoded using names of institutions found in the PDF front matter, or as listed on other resources. These papers have been manually verified to show that researchers downloaded and used the dataset to trainĀ or test machine learning algorithms. If you use our data, please cite our work.

Supplementary Information

 Nine of 11,917 images from the the Brainwash dataset. Graphics credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)
Nine of 11,917 images from the the Brainwash dataset. Graphics credit: megapixels.cc. License: Open Data Commons Public Domain Dedication (PDDL)

Cite Our Work

If you use our data, research, or graphics please cite our work:

@online{megapixels,
  author = {Harvey, Adam. LaPlace, Jules.},
  title = {MegaPixels: Origins, Ethics, and Privacy Implications of Publicly Available Face Recognition Image Datasets},
  year = 2019,
  url = {https://megapixels.cc/},
  urldate = {2019-04-18}
}

References

  • 1 a"readme.txt" https://exhibits.stanford.edu/data/catalog/sx925dc9385.
  • 2 aStewart, Russel. Andriluka, Mykhaylo. "End-to-end people detection in crowded scenes". 2016.
  • 3 aLi, Y. and Dou, Y. and Liu, X. and Li, T. Localized Region Context and Object Feature Fusion for People Head Detection. ICIP16 Proceedings. 2016. Pages 594-598.
  • 4 aZhao. X, Wang Y, Dou, Y. A Replacement Algorithm of Non-Maximum Suppression Base on Graph Clustering.