# **Reese Kuper** reesekuper.com | reese.kuper@gmail.com

## EDUCATION

| Masters – University of Illinois at Urbana-Champaign (3.97/4.00)<br>Department of Electrical and Computer Engineering<br>Focus: Computer Architecture                                                                     |                                                                                                                                                                                                                                      | 2021 - 2023                                                                                          |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------|
| <b>Bachelors</b> – University of Wisconsin-<br>Degrees in Computer Engineering and G                                                                                                                                      | · · · · · · · · · · · · · · · · · · ·                                                                                                                                                                                                | 2017 - 2021                                                                                          |
| Relevant Coursework <ul> <li>Parallel Computer Architecture</li> <li>Computer Microarchitecture</li> <li>Artificial Intelligence</li> </ul>                                                                               | <ul><li>Advanced Computer Architecture</li><li>Digital System Design &amp; Synthesis</li><li>Algorithms</li></ul>                                                                                                                    | <ul><li>Memory and Storage Systems</li><li>Operating Systems</li><li>System-on-Chip Design</li></ul> |
| SKILLS                                                                                                                                                                                                                    |                                                                                                                                                                                                                                      |                                                                                                      |
| Hardware Description and Programm<br>Development and Software Tools                                                                                                                                                       | ing Languages System Verilog, C/C++, P<br>Vivado, Vivado HLS, Moo                                                                                                                                                                    | ython, Bash Scripting, MySQL<br>delSim, Quartus, Vim, Git                                            |
| EXPERIENCE                                                                                                                                                                                                                |                                                                                                                                                                                                                                      |                                                                                                      |
| Arm<br>Graduate Hardware Engineer – Syster<br>• Improved and automated portions o                                                                                                                                         | ns Interconnect<br>f interconnect model simulation flow for systen                                                                                                                                                                   | 2024 - Present<br><i>Austin, TX</i><br>n analysis work.                                              |
| <ul><li>Explored use cases for DSA to take a</li><li>Submitted patents for improving me</li></ul>                                                                                                                         | Accelerators<br>DSA – an on-chip accelerator found on Sapphi<br>advantage of cache pollution mitigation and hig<br>mory deduplication techniques using DSA.<br>r Intel's on-chip accelerators at ISCA 2023.                          |                                                                                                      |
| <ul><li>Built a functional cache simulator to</li><li>Analyzed the characteristics of real</li></ul>                                                                                                                      | Systems Laboratory<br>L1 cache in a CXL device for DLRM offloading.<br>evaluate both hit rates and cache occupancy ra<br>DLRM data for locality patterns to aid in the de<br>via Ramulator to obtain bandwidth, hit rate, an         | sign of the memory system.                                                                           |
| <ul> <li>Coded macro for ease of adding</li> </ul>                                                                                                                                                                        | CPU memory system testbench.                                                                                                                                                                                                         | pench.                                                                                               |
| <ul><li>Fixed UVM register definition auto-</li><li>Programmed module for modeling t</li></ul>                                                                                                                            | ns Interconnect Verification<br>nalyze the use of all plusargs within the UVM to<br>generation for more flexible RAL models.<br>ransactions between a master device to intercor<br>SB priority arbiters using System Verilog assert: | nnect return nodes in SystemC.                                                                       |
| Qualcomm Inc.<br>Software Engineering Intern – Linux H<br>• Improved kernel ION allocation mer<br>• Analyzed the efficiency of IOVA's us<br>• Created internal Python tool for par<br>• Worked towards shifting mman allo | nory speeds by around 10%.<br>se of caching and compared it with MMAP's gap<br>sing Linux RAM dump binaries.                                                                                                                         | 2019<br><i>San Diego, CA</i><br>p searching RBTree.                                                  |

• Worked towards shifting mmap allocations to use the mempool API.

### PROJECTS

#### **Relevant Projects and Research**

- Investigated finer-grained GPU scheduling for improved GPU utilization in machine learning workloads.
- Self-balancing Segway written in Verilog for a DE0-Nano FPGA board.
- Synthesized a 5-stage pipelined CPU and a dataflow processor written in Verilog.
- Implemented prefetcher designs and age matrix issueing schemes for BOOM RISC-V processor on an FPGA.

### PUBLICATIONS, PATENTS, AND CONFERENCES

- A Quantitative Analysis and Guidelines of Data Streaming Accelerator in Modern Intel Xeon Scalable Processors R. Kuper, I. Jeong, Y. Yuan, R. Wang, N. Ranganathan, N. Rao, J. Hu, S. Kumar, P. Lantz, N. S. Kim International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2024
- Efficiently Merging Non-Identical Pages in Kernel Same-Page Merging (KSM) for Efficient and Improved Memory Deduplication and Security
   R. Kuper, Y. Yuan, R. Wang
   US Patent App 18/369,090 (Patent), 2024
- Method and Appratus for Batching Pages for a Data Movement Accelerator R. Kuper, Y. Yuan, R. Wang US Patent App 18/477,628 (Patent), 2024
- Demystifying CXL Memory with Genuine CXL-Ready Systems and Devices Y. Sun, Y. Yuan, Z. Yu, R. Kuper, C. Song, J. Huang, H. Ji, S. Agarwal, J. Lou, I. Jeong, R. Wang, J. H. Ahn, T. Xu, N. S. Kim International Symposium on Microarchitecture (MICRO), 2023
- On-chip Accelerators in 4th Gen Intel® Xeon® Scalable Processors: Features, Performance, Use Cases, and Future!
   R. Kuper, I. Jeong, Y. Yuan, J. Hu, R. Wang, N. Ranganathan, N. S. Kim
   [TUTORIAL] International Symposium on Computer Architecture (ISCA), 2023
- STYX: Exploiting SmartNIC Capability to Reduce Datacenter Memory Tax H. Ji, Y. Sun, M. Mansi, Y. Yuan, J. Huang, **R. Kuper**, M. Swift, N. S. Kim The USENIX Annual Technical Conference (ATC), 2023
- Improving GPU Utilization in ML Workloads Through Finer-Grained Synchronization R. Kuper, S. Pati, M. Sinclair Young Architect Workshop (YArch), 2021