Eric Chen

PhD Student, Carnegie Mellon University

ericc3 [AT] andrew.cmu.edu

Bio

I am a first-year PhD student at Carnegie Mellon University. I build modern, principled frameworks that enables reliable inference and decision-making from complex data. My interests include:

AI for Science
How can we integrate modern AI into existing research pipelines to accelerate scientific discovery from complex, high-dimensional data while ensuring trustworthy scientific conclusions?
Causal Inference
How can we leverage machine learning tools to estimate causal effects while preserving validity and develop robust methods under modern data challenges like confounding and selection bias?
Data-Driven Decision-Making
What are the fundamental limits of learning in interactive environments, and how can we design provably efficient algorithms with theoretical guarantees that scale to real-world systems?
Explainable AI
How can we develop explanation methods that not only describe but reliably predict model behavior, grounded in rigorous statistical methodology and large-scale empirical validation?

News

Sep 2025 Started my PhD at Carnegie Mellon University!

Publications

BY TYPE / BY AREA
  • Selected
  • Preprint
  • Conference
  • Journal

Top-k Feature Importance Ranking PDF

Yuxi Chen, Tiffany Tang, Genevera Allen

arXiv preprint

Deep Memory Unrolled Networks for Solving Imaging Linear Inverse Problems Abstract PDF Slides

Yuxi Chen, Xi Chen, Shirin Jalali, Arian Maleki

Sampling Theory and Applications (SampTA), 2025, (Oral)

Unrolled networks have emerged as one of the most successful methods in imaging applications. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network's overall performance. These decisions include selecting the optimization algorithm to unroll, defining the loss function, deciding on the structure of residual connections, and determining the number of convolutional layers, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train the neural network. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive numerical study to discover the optimal design choices. We anticipate that this study will help scientists and engineers design unrolled networks for their applications and diagnose problems within their networks efficiently.

Comprehensive Examination of Unrolled Networks for Solving Linear Inverse Problems Abstract PDF

Yuxi Chen, Xi Chen, Arian Maleki, Shirin Jalali

Entropy (Special Issue on Advances in Computational Imaging), 2025

Unrolled networks have become prevalent in various computer vision and imaging tasks. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network’s overall performance. These decisions include selecting the optimization algorithm, defining the loss function, and determining the deep architecture, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train, fine-tune the neural network, and optimize its performance. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive ablation study to discuss the impact of each of the choices involved in designing unrolled networks and present practical recommendations based on our findings. We anticipate that this study will help scientists and engineers to design unrolled networks for their applications and diagnose problems within their networks efficiently.

Joint Optimization of Multiple Resources for Distributed Service Deployment in Satellite Edge Computing Networks Abstract PDF

Jiachen Sun, Xu Chen, Zhen Li, Jiawei Wang, Yuxi Chen

IEEE Internet of Things Journal, 2024

With the emergence of mobile edge applications and the demand for access-as-a-service, satellite mobile edge computing stands out as a disruptive technology for delivering low-latency edge service. In this article, we focus on service deployment to the edge satellites for terrestrial users, which is a key enabling technology in satellite mobile edge computing and will replace traditional centralized cloud computing. Most existing works on service deployment consider a centralized nonconvex optimization problem with high computational overhead. However, in practice, it is difficult for a single satellite to solve computationally expensive network optimization problems. To this end, we propose a distributed optimization model based on the alternating direction method of multipliers (ADMMs), which can relieve the computational burden by leveraging collaborative calculations among multiple satellites. Our proposed model minimizes the total delay of service deployment for terrestrial users by formulating a joint optimization problem that involves deployment decisions, CPU resource decisions, transmission decisions, and caching decisions. Furthermore, we propose a novel approximation method that transforms the nonconvex optimization problem to a convex one to make the joint optimization problem solvable in polynomial time. Finally, we conduct experiments using scaled global population data and show that the proposed distributed model outperforms the baselines.
  • AI for Science
  • Causal Inference
  • Decision-Making
  • Explainable AI

Top-k Feature Importance Ranking PDF

Yuxi Chen, Tiffany Tang, Genevera Allen

arXiv preprint

Teaching

Carnegie Mellon University
36-401 Modern Regression
Teaching Assistant Fall 2025
This course is an introduction to applied data analysis using linear regression modeling. We will derive properties about those models, apply and examine various models for real datasets, assess the validity of modeling assumptions, and determine what conclusions we can make (if any) from those models.