PhD student, Computer Systems Lab, Cornell University

Neil Adit

🕿 na469@cornell.edu | 🏾 neiladit.com | 🖸 github.com/neiladit

I am a research scientist at Meta working on developing efficient foundational recommendation systems at scale. Previously, I was a student researcher at Google and PhD candidate at Cornell, working on hardware-software co-design for datacenter workloads. My research interests lie at the intersection of Efficient Machine Learning, Compilers and Computer Architecture.

## Education\_

| Cornell University                                                                                                                                                                                   | Aug 2018 - June 2024  |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| PhD in Computer Engineering                                                                                                                                                                          | GPA: 4/4              |
| <ul> <li>Committee: Adrian Sampson (chair), Zhiru Zhang, Chris De Sa</li> <li>Courses: Advanced ML Systems, Computer Vision, Advanced Compilers, Datacenter Computing, Parallel Computing</li> </ul> |                       |
| Indian Institute of Technology, Bombay                                                                                                                                                               | July 2013 - June 2018 |
| B.Tech + M.Tech in Electrical Engineering                                                                                                                                                            | GPA: 9.05/10          |
| <ul> <li>Masters in Microelectronics, Minor in Computer Science</li> <li>Advisor: Sachin Patkar</li> </ul>                                                                                           |                       |
| Publications                                                                                                                                                                                         |                       |
| Performance Left on the Table: An Evaluation of Compiler Auto-Vectorization for RISC-V                                                                                                               | IEEE Micro 2022       |
| <u>Neil Adit</u> and Adrian Sampson                                                                                                                                                                  |                       |
| Software-Defined Vector Processing on Manycore Fabrics                                                                                                                                               | MICRO 2021            |
| Philip Bedoukian, <u>Neil Adit</u> , Edwin Peguero, Adrian Sampson                                                                                                                                   |                       |
| Dense Pruning of Pointwise Convolutions in the Frequency Domain                                                                                                                                      | arxiv preprint 2021   |
| Mark Buckler, <u>Neil Adit</u> , Yuwei Hu, Zhiru Zhang, and Adrian Sampson                                                                                                                           |                       |
| Dagger: Efficient and Fast RPCs in Cloud Microservices with Near-Memory Reconfigurable<br>NICs                                                                                                       | ASPLOS 2021           |
| Nikita Lazarev, Shaojie Xiang, <u>Neil Adit</u> , Zhiru Zhang, Christina Delimitrou                                                                                                                  |                       |
| Dagger: Towards Efficient RPCs in Cloud Microservices with Near-Memory                                                                                                                               |                       |
| Reconfigurable NICs                                                                                                                                                                                  | IEEE CAL 2020         |
| Nikita Lazarev, <u>Neil Adit</u> , Shaojie Xiang, Zhiru Zhang, and Christina Delimitrou                                                                                                              |                       |
| Industry Experience                                                                                                                                                                                  |                       |

| Meta                                                                                                                                                                                                                                                                          | Menlo Park, CA, USA        |  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------|--|
| Research Scientist                                                                                                                                                                                                                                                            | July 2024 - Present        |  |
| Developing efficient large recommendation system models , optimized for various hardware backends                                                                                                                                                                             |                            |  |
| Google                                                                                                                                                                                                                                                                        | Sunnyvale, CA, USA         |  |
| Student Researcher   Advisors: Akanksha Jain and Snehasish Kumar                                                                                                                                                                                                              | May 2022 - May 2024        |  |
| Developed a hardware-software co-design infrastructure for GSoC performance optimization, including profile-driven compiler analysis<br>Demonstrated performance improvements via microarchitectural modifications in cycle-accurate simulator, modeling datacenter behaviour |                            |  |
| Microsoft Research                                                                                                                                                                                                                                                            | Redmond, WA, USA           |  |
| Research Intern   Advisor: Ofer Dekel                                                                                                                                                                                                                                         | May 2021 - Aug 2021        |  |
| <ul> <li>Developed algorithms to accelerate sparse ML models on commodity hardware in the Machine Learning and Opt</li> <li>Demonstrated wall-clock speedups on sparse kernel execution using the ONNX runtime library backend</li> </ul>                                     | imization group, at MSR    |  |
| Intel Labs                                                                                                                                                                                                                                                                    | Santa Clara, CA, USA       |  |
| Graduate Research Intern   Advisor: Fabrizio Petrini                                                                                                                                                                                                                          | May 2019 - Aug 2019        |  |
| Designed and implemented high performance computing algorithms for sparse computations on Intel's breakthr                                                                                                                                                                    | ough research architecture |  |
| SIEMENS Research                                                                                                                                                                                                                                                              | Bangalore, India           |  |
| Summer Intern   Advisors: Dr. Amit Kale and Prabhu Teja                                                                                                                                                                                                                       | May 2016 - Jul. 2016       |  |
|                                                                                                                                                                                                                                                                               |                            |  |

## Designed and demonstrated Kidney segmentation in CT images for clinical diagnosis using Laplacian Mesh Deformation

## Research Experience

| Compile                                                   | r Auto-Vectorization for Scalable Vectors                                                                                                                                                                                                                                                                                              | Ithaca, NY, USA       |
|-----------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|
| ADVISOR: P                                                | ROF. ADRIAN SAMPSON, CORNELL UNIVERSITY                                                                                                                                                                                                                                                                                                | Aug. 2021 - May 2022  |
| <ul> <li>Designed</li> </ul>                              | ed backend-independent ScaleIR for arbitrary representations to optimize instruction selection and hardware pe                                                                                                                                                                                                                         | rformance             |
| Softwar                                                   | e-Defined Vectors on Manycore                                                                                                                                                                                                                                                                                                          | Ithaca, NY, USA       |
| Advisor: P                                                | rof. Adrian Sampson, Cornell University                                                                                                                                                                                                                                                                                                | Jan. 2019 - Aug. 2020 |
| • Worked                                                  | with Philip Bedoukian on vector programming model that allows dynamic reconfiguration of manycore tiles int                                                                                                                                                                                                                            | o vector engines      |
| Frequen                                                   | cy Domain Dense Pruning of Pointwise Convolutions                                                                                                                                                                                                                                                                                      | Ithaca, NY, USA       |
| Advisor: P                                                | rof. Adrian Sampson, Cornell University                                                                                                                                                                                                                                                                                                | Aug. 2018 - Sep. 2021 |
| • Worked                                                  | with Mark Buckler on exploiting spatial redundancy in depthwise convolutions by pruning in the frequency dom                                                                                                                                                                                                                           | nain                  |
| Near-Me                                                   | mory Reconfigurable NICs                                                                                                                                                                                                                                                                                                               | Ithaca, NY, USA       |
| Advisor: P                                                | ROF. CHRISTINA DELIMITROU, CORNELL UNIVERSITY                                                                                                                                                                                                                                                                                          | Jan. 2020 - Aug. 2020 |
| <ul><li>We offle</li><li>Designed</li></ul>               | bad the RPC stack on a FPGA which is tightly coupled with the host CPU via memory interconnects, Intel UPI in th<br>ed queue management for asynchronously sending packets in a single connection                                                                                                                                      | is case               |
| Accelera                                                  | ting 1x1 Convolutions using Systolic Arrays                                                                                                                                                                                                                                                                                            | Ithaca, NY, USA       |
| Advisor: P                                                | rof. Zhiru Zhang, Cornell University                                                                                                                                                                                                                                                                                                   | Oct. 2018 - Dec. 2018 |
| <ul><li>Implem</li><li>Optimiz</li><li>Achieve</li></ul>  | nented pointwise convolutions in MobileNets on Zynq ZC-706 using systolic arrays.<br>2ed streaming of input activations using quantization, bit packing and padding.<br>2d close to ideal, 215x speedup using 16x16 parallel PEs for systolic array architecture.                                                                      |                       |
| Parallel                                                  | Sparse Matrix Solution on FPGA                                                                                                                                                                                                                                                                                                         | Mumbai, India         |
| Advisor: P                                                | rof. Sachin Patkar, IIT Bombay                                                                                                                                                                                                                                                                                                         | Jul. 2017 - Jun. 2018 |
| • Acceler                                                 | ating sparse matrix solvers for performance improvements in SPICE circuit simulators                                                                                                                                                                                                                                                   |                       |
| <ul><li>Designi</li><li>Achieve</li></ul>                 | ng a stack based processor with pipelined FPU to process LU expressions parallely<br>ed upto 6x speedup using parallel hardware directives, optimizing off-chip memory access and minimizing arithn                                                                                                                                    | netic operations      |
| Person I                                                  | Re-Identification using Deep Learning                                                                                                                                                                                                                                                                                                  | Mumbai India          |
| Advisor: P                                                | rof. Subhasis Chaudhuri. IIT Bombay                                                                                                                                                                                                                                                                                                    | Jul. 2017 - Dec. 2017 |
| <ul><li>Develop</li><li>Modelle</li><li>Achieve</li></ul> | bing a Deep Learning model to spot person of interest across cameras for surveillance applications<br>and a RNN (temporal features) and CNN (spatial features) based Siamese network for video-based re-identification<br>and rank-1 accuracy 60% comparable to state-of-the-art with smaller test image sequence on iLIDS-VID dataset | in Torch              |
| Acad                                                      | emic Achievements                                                                                                                                                                                                                                                                                                                      |                       |
| 2018                                                      | Eastman Fellowship, Cornell University                                                                                                                                                                                                                                                                                                 | Ithaca, U.S.A         |
| 2017                                                      | Excellence in Teaching Assistantship, EE, IIT Bombay                                                                                                                                                                                                                                                                                   | Mumbai, India         |
| 2013                                                      | All India Rank 242, IIT Joint Entrance Exam (JEE)-Advanced, among 1.4 million examinees                                                                                                                                                                                                                                                | India                 |
| 2012                                                      | Ranked 115, KVPY Scholarship, Govt. of India , among 200,000 candidates                                                                                                                                                                                                                                                                | India                 |
| 2012                                                      | Top 1%, National Physics Olympiad                                                                                                                                                                                                                                                                                                      | Delhi, India          |
| 2011                                                      | <b>Ranked 20</b> , Regional Mathematics Olympiad (RMO) and among top 900 nationally to compete in Indian National Mathematical Olympiad (INMO)                                                                                                                                                                                         | Delhi, India          |

| National Mathematical Olympiad (INMO)                                                                                                                                                                                                                                                                                                                                                                                                                     |                               |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------|
| Extracurricular Activity                                                                                                                                                                                                                                                                                                                                                                                                                                  |                               |
| Institute Student Mentorship Programme (ISMP)                                                                                                                                                                                                                                                                                                                                                                                                             | IIT Bombay, India             |
| Student Mentor                                                                                                                                                                                                                                                                                                                                                                                                                                            | Aug. 2016 - May. 2018         |
| Selected for 2 consecutive years as part of team of 82 mentors from 368 applicant.                                                                                                                                                                                                                                                                                                                                                                        |                               |
| Mentored 24 students for smooth transition to campus life, supporting their academic & co-curricular ender                                                                                                                                                                                                                                                                                                                                                | avors                         |
| Formula Student Racing Team                                                                                                                                                                                                                                                                                                                                                                                                                               | IIT Bombay, India             |
| Design Engineer                                                                                                                                                                                                                                                                                                                                                                                                                                           | Sep. 2014 - Apr. 2016         |
| A team of 70 students building India's fastest electric racecar for Formula Student UK, an international race ca<br>Award for 2 consecutive years worth £3000 (2 out of 48 non-UK teams) for major design improvements                                                                                                                                                                                                                                    | ar design competition. Won FS |
| <ul> <li>Headed a team of 5 Engineers to design onboard data logging and real-time remote wireless data monitories</li> <li>Implemented team's first CAN protocol to improve reliability in communication and reduce wiring harness</li> <li>Implemented Electronic Differential and Regenerative Braking for the first time within the team</li> <li>Achieved 2x reduction in size and weight of PCB enclosure by optimized routing in boards</li> </ul> | ing GUI system                |