David Dao
daviddao at broad.mit.edu
PhD candidate at ETH Zurich | UC Berkeley | MIT Broad
I'm a Ph.D. student at ETH Zurich DS3Lab developing systems to scale human cooperation. My research leverages blockchain-based AI to realize new types of privacy-preserving incentives for sustainability, medicine, and ethical AI 馃攬. I was an engineer in Silicon Valley and a research fellow at Berkeley AI Research (BAIR) and MIT Broad Institute. I'm a Global Shaper at World Economic Forum and I organize several large conferences in Germany, Silicon Valley, and at Harvard. My work was featured in MIT Technology Review, The Scientist and The New York Times.

馃洬 I'm traveling quite a lot.

Research Medium Blog

Want to change the world and finish your thesis at the same time? Come work with me, here are some thesis proposals!

Trustless Machine Learning
Data Valuation: How much is your data worth? 路 Joint project with UC Berkeley
DataBright: Using smart contracts to democratize data and machines
Data Systems for Sustainability
GainForest: Deforestation markets of the Amazon Rainforest using satellite imagery 路 Microsoft AI for Earth Grant 路 Grand Prize UNFCCC Hack4Climate at COP23
Data-driven planning and traffic prediction for self-driving cars 路 Joint project with Mercedes-Benz Research
Privacy-Preserving Systems for Medicine
Kara: A privacy-preserving tokenized data market for medical data 路 Joint project with UC Berkeley
Cyto.ai: Interactive machine learning for cell biology 路 Joint project with Broad Institute of MIT and Harvard

Publications Google Scholar


鈥淗ow Much is My Data Worth?鈥: Data Valuation with Efficient Shapley Value Estimation
DataBright: Towards a global exchange for decentralized data ownership and trusted computation


A Demonstration of Sterling: A Privacy-Preserving Data Marketplace


An open-source solution for advanced imaging flow cytometry data analysis using machine learning


CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets


Anatomy of BioJS, an open source community for the life sciences


Automated Plausibility Analysis of Large Phylogenies

Open Source Software Github

Almost all of my work is open source

profile for David Dao on Stack Exchange, a network of free, community-driven Q&A sites

StarAwful-AI 馃槇 is a curated list to track current scary usages of AI - hoping to raise awareness
StarAwesome-Deep-Learning 馃敟 is a curated list of papers about very deep neural networks
StarSpatial Transformer 馃寪 is part of TensorFlow Models (where I'm co-author)
StarCellProfiler Analyst 馃敩 is an adaptive machine learning tool for biologists
StarBioJS 馃敩 is an interactive visualization ecosystem for life science

Scientific Collaborators

I'm grateful to work with my scientific collaborators

Dawn Song, Ruoxi Jia, Nick Hynes (UC Berkeley)
Robert Chang (Stanford Medicine)
Anne Carpenter, Allen Goodman (MIT Broad Imaging Platform)
Joe Near (University of Vermont)
Yan Meng (Mercedes-Benz Research)

Former/Current Students

I'm proud of my students

Catherine Cang (UC Berkeley)
Jeffrey Liu (UC Berkeley)
Luca Lanzendorfer (ETH / Now at Mercedes-Benz Research)
Florian Chlan (ETH / Now at UBS)
Nino Weingart (ETH)
Christopher Friedrich (Reutlingen / Now at MIT)

You can follow me on Twitter at @dwddao.