I joined the TensorFlow team at Google AI in July 2018.

I received my Ph.D. in Computer Science from Princeton University, advised by Professor Michael J. Freedman. My research focuses on designing scalable, reliable, and highly-efficient distributed systems, which include topics on datacenter resource scheduling, big-data analytics, distributed machine learning, video stream processing, and in-network computing.

Previously, I interned at Facebook (summer 2017) and Microsoft Research (summer 2015). Before coming to Princeton, I received my B.Sc. in Computer Science from Peking University in 2013. I also received my B.A. in Economics (double major) from National School of Development.


  me (at) haoyuzhang (dot) org



My  Portrait.

Google AI

Software Engineer, TensorFlow
07/2018--Present

Princeton University, S* Network Systems (SNS) Group

Ph.D. in Computer Science
09/2013--06/2018

Facebook, Data Infrastructure

Ph.D. Software Engineer Intern
06/2017--08/2017

Microsoft Research, Mobility and Networking Research

Research Intern
06/2015--08/2015

Peking University

B.Sc. in Computer Science, B.A. in Economics
09/2009--06/2013

Technion -- Israel Institute of Technology

Research Exchange Student
09/2012--02/2013


Ph.D. Thesis

Resource Management for Advanced Data Analytics at Large Scale

Haoyu Zhang

Ph.D. Thesis. Princeton University, 2018

[PDF] [Slides] [BibTeX]


Riffle

Riffle: Facebook's Optimized Shuffle Service for Large-Scale Data Analytics

Haoyu Zhang, Brian Cho, Ergin Seyfe, Avery Ching, Michael J. Freedman

European Conference on Computer Systems (EuroSys '18). Porto, Portugal

[PDF] [Slides] [BibTeX] [Spark Summit]


NetChain

NetChain: Scale-Free Sub-RTT Coordination

Xin Jin, Xiaozhou Li, Haoyu Zhang, Nate Foster, Jeongkeun Lee, Robert Soulé, Changhoon Kim, Ion Stoica

USENIX Symposium on Networked Systems Design and Implementation (NSDI '18). Renton, WA, USA

Best Paper Award   [PDF] [Extended Version] [Slides] [BibTeX] [The Morning Paper]


SLAQ Scheduler

SLAQ: Quality-Driven Scheduling for Distributed Machine Learning

Haoyu Zhang, Logan Stafman, Andrew Or, Michael J. Freedman

ACM Symposium on Cloud Computing (SoCC '17). Santa Clara, CA, USA

Best Paper Award   [PDF] [Slides] [BibTeX]

Poster (extended abstract) at the 1st SysML Conference (SysML '18). Stanford, CA, USA


NetCache

NetCache: Balancing Key-Value Stores with Fast In-Network Caching

Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soulé, Nate Foster, Jeongkeun Lee, Changhoon Kim, Ion Stoica

ACM Symposium on Operating Systems Principles (SOSP '17). Shanghai, China

[PDF] [Slides] [BibTeX]


Distributed Video Analytics

Live Video Analytics at Scale with Approximation and Delay-Tolerance

Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, Michael J. Freedman

USENIX Symposium on Networked Systems Design and Implementation (NSDI '17). Boston, MA, USA

[PDF] [Slides] [BibTeX] [Project Website]


Ravana Fault-Tolerant Controller

Ravana: Controller Fault-Tolerance in Software-Defined Networking

Naga Katta, Haoyu Zhang, Michael J. Freedman, Jennifer Rexford

ACM Symposium on SDN Research (SOSR '15). Santa Clara, CA, USA

[PDF] [BibTeX]



Professional Services

2018 | Program Committee: MLSys Workshop at NeurIPS

2018 | Reviewer: JSAC SI-Caching, JSAC SI-SDN Scalability, Trans. Big Data, HPSR

2017 | Program Committee: MobiSys Ph.D. Forum

2017 | Reviewer: ACM TACO, MLSys Workshop at NeurIPS


Other Services

2014--2015 | Publicity Chair, Association of Chinese Students and Scholars at Princeton University (ACSSPU)

2014--2016 | Publicity Chair, Princeton Association of Chinese Entrepreneurs (PACE)

QR Code
Email:me (at) haoyuzhang (dot) org
 
Phone:(650) [tu: oh θri:] 8976
 
Address: Google Building 41
1600 Amphitheatre Pkwy
Mountain View, CA 94043
 
Find me on: