David Dao
daviddao at broad.mit.edu
PhD candidate at ETH Zurich | Stanford | UC Berkeley | MIT Broad
I'm a Ph.D. candidate at ETH Zurich DS3Lab and researcher at Stanford University (re)inventing future machine learning systems. My research leverages Blockchain-based incentives and AI to realize new types of privacy-preserving data systems for sustainability, medicine, and ethics. I'm a co-founder of GainForest, an award-winning non-profit that provides AI-powered conservation tools to prevent deforestation. I was an engineer in Silicon Valley and a research fellow at Berkeley AI Research (BAIR) and Broad Institute of MIT and Harvard. A Global Shaper at World Economic Forum, I organized several large conferences in Germany, Silicon Valley, and at Harvard. My work was featured in MIT Technology Review, The Scientist, The New York Times and at United Nation's Climate Change conference.
In short for millenials:
PhD candidate @DS3Lab. Inventing crazy ML systems.
Co-founder @GainForest. Using 馃洶 and 馃幃to save 馃尨
Goal: Collect every Ivy League 馃摟mail

鉁堬笍 I'm traveling...

Research Medium Blog

Want to change the world and finish your thesis at the same time? Come work with me, here are some thesis proposals!

Trustworthy Machine Learning
Data Valuation: How much is your data worth? 路 Joint project with UC Berkeley
DataBright: Using smart contracts to democratize data and machines
Learning Systems for Sustainability
GainForest: Deforestation markets of the Amazon Rainforest using satellite imagery 路 Microsoft AI for Earth Grant 路 Grand Prize UNFCCC Hack4Climate at COP23 路 Presentation at COP24
Data-driven planning and traffic prediction for self-driving cars 路 Joint project with Mercedes-Benz Research
Privacy-Preserving Learning Systems for Medicine
Kara: A privacy-preserving tokenized data market for medical data 路 Joint project with UC Berkeley and Stanford
Cyto.ai: Interactive machine learning for cell biology 路 Joint project with Broad Institute of MIT and Harvard

Publications Google Scholar


DataBright: Towards a global exchange for decentralized data ownership and trusted computation


Towards Efficient Data Valuation Based on the Shapley Value


A Demonstration of Sterling: A Privacy-Preserving Data Marketplace


An open-source solution for advanced imaging flow cytometry data analysis using machine learning


CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets


Anatomy of BioJS, an open source community for the life sciences


Automated Plausibility Analysis of Large Phylogenies

Open Source Software Github

Almost all of my work is open source

profile for David Dao on Stack Exchange, a network of free, community-driven Q&A sites

StarAwful-AI 馃槇 is a curated list to track current scary usages of AI - hoping to raise awareness
StarAwesome-Deep-Learning 馃敟 is a curated list of papers about very deep neural networks
StarSpatial Transformer 馃寪 is part of TensorFlow Models (where I'm co-author)
StarCellProfiler Analyst 馃敩 is an adaptive machine learning tool for biologists
StarBioJS 馃敩 is an interactive visualization ecosystem for life science

Scientific Collaborators

I'm grateful to work with my scientific collaborators

Dawn Song, Ruoxi Jia, Nick Hynes (UC Berkeley)
Robert Chang (Stanford Medicine)
Anne Carpenter, Allen Goodman (MIT Broad Imaging Platform)
Joe Near (University of Vermont)
Yan Meng (Mercedes-Benz Research)

Former/Current Students

I'm proud of my students

Catherine Cang (UC Berkeley)
Luca Lanzendorfer (ETH / Now at Mercedes-Benz Research)
Florian Chlan (ETH / Now at UBS)
Nino Weingart (ETH)
Christopher Friedrich (Reutlingen / Now at MIT)

Scientific Service

Active member of the scientific community

PC Demo Track SysML'19
Reviewer ISC-HPC'15

You can follow me on Twitter at @dwddao.