[YOUR NAME]

Bio

I am a first-year PhD student at Carnegie Mellon University. My research focuses on developing principled foundations for learning and inference in complex systems, with applications to scientific discovery. My current interests include:

Causality

How can we combine flexible machine learning models with principled statistical frameworks to uncover causal mechanisms from complex observational data, under modern challenges like confounding, distribution shift, and limited experimentation?

Foundations of Learning

What shared principles govern how neural networks and biological systems alike extract structure, form representations, and generalize from experience, and can we leverage these insights to understand and improve modern machine learning paradigms?

Scientific Applications

How can we harness generative models and modern AI to learn the structure of complex scientific data, simulate unseen conditions, and accelerate discovery, while ensuring the conclusions remain reliable and interpretable?

News

Mar 2026 Our paper clarifying the relationship between double machine learning and influence functions is on arXiv.

Dec 2025 Our paper on feature importance ranking is accepted to TMLR!

Publications

ALL / BY AREA / BY TYPE

Selected
Preprint
Conference
Journal

Deep Memory Unrolled Networks for Solving Imaging Linear Inverse Problems Abstract PDF Slides

Yuxi Chen, Xi Chen, Shirin Jalali, Arian Maleki

Sampling Theory and Applications (SampTA), 2025 (Oral)

Unrolled networks have emerged as one of the most successful methods in imaging applications. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network's overall performance. These decisions include selecting the optimization algorithm to unroll, defining the loss function, deciding on the structure of residual connections, and determining the number of convolutional layers, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train the neural network. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive numerical study to discover the optimal design choices. We anticipate that this study will help scientists and engineers design unrolled networks for their applications and diagnose problems within their networks efficiently.

Top-k Feature Importance Ranking Abstract PDF

Yuxi Chen, Tiffany Tang, Genevera Allen

Transactions on Machine Learning Research (TMLR), 2025

Accurate ranking of important features is a fundamental challenge in interpretable machine learning with critical applications in scientific discovery and decision-making. Unlike feature selection and feature importance, the specific problem of ranking important features has received considerably less attention. We introduce RAMPART (Ranked Attributions with MiniPatches And Recursive Trimming), a framework that utilizes any existing feature importance measure in a novel algorithm specifically tailored for ranking the top-k features. Our approach combines an adaptive sequential halving strategy that progressively focuses computational resources on promising features with an efficient ensembling technique using both observation and feature subsampling. Unlike existing methods that convert importance scores to ranks as post-processing, our framework explicitly optimizes for ranking accuracy. We provide theoretical guarantees showing that RAMPART achieves the correct top-k ranking with high probability under mild conditions, and demonstrate through extensive simulation studies that RAMPART consistently outperforms popular feature importance methods, concluding with two high-dimensional genomics case studies.

Comprehensive Examination of Unrolled Networks for Solving Linear Inverse Problems Abstract PDF

Yuxi Chen, Xi Chen, Arian Maleki, Shirin Jalali

Entropy (Special Issue on Advances in Computational Imaging), 2025

Unrolled networks have become prevalent in various computer vision and imaging tasks. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network's overall performance. These decisions include selecting the optimization algorithm, defining the loss function, and determining the deep architecture, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train, fine-tune the neural network, and optimize its performance. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive ablation study to discuss the impact of each of the choices involved in designing unrolled networks and present practical recommendations based on our findings. We anticipate that this study will help scientists and engineers to design unrolled networks for their applications and diagnose problems within their networks efficiently.

Joint Optimization of Multiple Resources for Distributed Service Deployment in Satellite Edge Computing Networks Abstract PDF

Jiachen Sun, Xu Chen, Zhen Li, Jiawei Wang, Yuxi Chen

IEEE Internet of Things Journal, 2024

With the emergence of mobile edge applications and the demand for access-as-a-service, satellite mobile edge computing stands out as a disruptive technology for delivering low-latency edge service. In this article, we focus on service deployment to the edge satellites for terrestrial users, which is a key enabling technology in satellite mobile edge computing and will replace traditional centralized cloud computing. Most existing works on service deployment consider a centralized nonconvex optimization problem with high computational overhead. However, in practice, it is difficult for a single satellite to solve computationally expensive network optimization problems. To this end, we propose a distributed optimization model based on the alternating direction method of multipliers (ADMMs), which can relieve the computational burden by leveraging collaborative calculations among multiple satellites. Our proposed model minimizes the total delay of service deployment for terrestrial users by formulating a joint optimization problem that involves deployment decisions, CPU resource decisions, transmission decisions, and caching decisions. Furthermore, we propose a novel approximation method that transforms the nonconvex optimization problem to a convex one to make the joint optimization problem solvable in polynomial time. Finally, we conduct experiments using scaled global population data and show that the proposed distributed model outperforms the baselines.

Causal Inference
Generative Modeling
Statistical Learning
Scientific Applications

On the Equivalence between Neyman Orthogonality and Pathwise Differentiability Abstract PDF

Yuxi Chen, Edward H. Kennedy, Sivaraman Balakrishnan

arXiv preprint

It has been frequently observed that Neyman orthogonality, the central device underlying double/debiased machine learning (Chernozhukov et al., 2018), and pathwise differentiability, a cornerstone concept from semiparametric theory, often lead to the same debiased estimators in practice. Despite the widespread adoption of both ideas, the precise nature of this equivalence has remained elusive, with the two concepts having been developed in largely separate traditions. In this work, we revisit the semiparametric framework of van der Laan and Robins (2003) and identify an implicit regularity assumption on the relationship between target and nuisance parameters---a local product structure---that allows us to establish a formal equivalence between Neyman orthogonality and pathwise differentiability. We demonstrate that the two directions of this equivalence impose fundamentally different structural requirements, and illustrate the theory through a concrete example of estimating the average treatment effect. This helps clarify the relationship between these two foundational frameworks and provides a useful reference for practitioners working at their intersection.

Top-k Feature Importance Ranking Abstract PDF

Yuxi Chen, Tiffany Tang, Genevera Allen

Transactions on Machine Learning Research (TMLR), 2025

Accurate ranking of important features is a fundamental challenge in interpretable machine learning with critical applications in scientific discovery and decision-making. Unlike feature selection and feature importance, the specific problem of ranking important features has received considerably less attention. We introduce RAMPART (Ranked Attributions with MiniPatches And Recursive Trimming), a framework that utilizes any existing feature importance measure in a novel algorithm specifically tailored for ranking the top-k features. Our approach combines an adaptive sequential halving strategy that progressively focuses computational resources on promising features with an efficient ensembling technique using both observation and feature subsampling. Unlike existing methods that convert importance scores to ranks as post-processing, our framework explicitly optimizes for ranking accuracy. We provide theoretical guarantees showing that RAMPART achieves the correct top-k ranking with high probability under mild conditions, and demonstrate through extensive simulation studies that RAMPART consistently outperforms popular feature importance methods, concluding with two high-dimensional genomics case studies.

2026

On the Equivalence between Neyman Orthogonality and Pathwise Differentiability Abstract PDF

Yuxi Chen, Edward H. Kennedy, Sivaraman Balakrishnan

arXiv preprint

It has been frequently observed that Neyman orthogonality, the central device underlying double/debiased machine learning (Chernozhukov et al., 2018), and pathwise differentiability, a cornerstone concept from semiparametric theory, often lead to the same debiased estimators in practice. Despite the widespread adoption of both ideas, the precise nature of this equivalence has remained elusive, with the two concepts having been developed in largely separate traditions. In this work, we revisit the semiparametric framework of van der Laan and Robins (2003) and identify an implicit regularity assumption on the relationship between target and nuisance parameters---a local product structure---that allows us to establish a formal equivalence between Neyman orthogonality and pathwise differentiability. We demonstrate that the two directions of this equivalence impose fundamentally different structural requirements, and illustrate the theory through a concrete example of estimating the average treatment effect. This helps clarify the relationship between these two foundational frameworks and provides a useful reference for practitioners working at their intersection.

2025

Top-k Feature Importance Ranking Abstract PDF

Yuxi Chen, Tiffany Tang, Genevera Allen

Transactions on Machine Learning Research (TMLR), 2025

Accurate ranking of important features is a fundamental challenge in interpretable machine learning with critical applications in scientific discovery and decision-making. Unlike feature selection and feature importance, the specific problem of ranking important features has received considerably less attention. We introduce RAMPART (Ranked Attributions with MiniPatches And Recursive Trimming), a framework that utilizes any existing feature importance measure in a novel algorithm specifically tailored for ranking the top-k features. Our approach combines an adaptive sequential halving strategy that progressively focuses computational resources on promising features with an efficient ensembling technique using both observation and feature subsampling. Unlike existing methods that convert importance scores to ranks as post-processing, our framework explicitly optimizes for ranking accuracy. We provide theoretical guarantees showing that RAMPART achieves the correct top-k ranking with high probability under mild conditions, and demonstrate through extensive simulation studies that RAMPART consistently outperforms popular feature importance methods, concluding with two high-dimensional genomics case studies.

Deep Memory Unrolled Networks for Solving Imaging Linear Inverse Problems Abstract PDF Slides

Yuxi Chen, Xi Chen, Shirin Jalali, Arian Maleki

Sampling Theory and Applications (SampTA), 2025 (Oral)

Unrolled networks have emerged as one of the most successful methods in imaging applications. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network's overall performance. These decisions include selecting the optimization algorithm to unroll, defining the loss function, deciding on the structure of residual connections, and determining the number of convolutional layers, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train the neural network. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive numerical study to discover the optimal design choices. We anticipate that this study will help scientists and engineers design unrolled networks for their applications and diagnose problems within their networks efficiently.

Comprehensive Examination of Unrolled Networks for Solving Linear Inverse Problems Abstract PDF

Yuxi Chen, Xi Chen, Arian Maleki, Shirin Jalali

Entropy (Special Issue on Advances in Computational Imaging), 2025

Unrolled networks have become prevalent in various computer vision and imaging tasks. Although they have demonstrated remarkable efficacy in solving specific computer vision and computational imaging tasks, their adaptation to other applications presents considerable challenges. This is primarily due to the multitude of design decisions that practitioners working on new applications must navigate, each potentially affecting the network's overall performance. These decisions include selecting the optimization algorithm, defining the loss function, and determining the deep architecture, among others. Compounding the issue, evaluating each design choice requires time-consuming simulations to train, fine-tune the neural network, and optimize its performance. As a result, the process of exploring multiple options and identifying the optimal configuration becomes time-consuming and computationally demanding. The main objectives of this paper are (1) to unify some ideas and methodologies used in unrolled networks to reduce the number of design choices a user has to make, and (2) to report a comprehensive ablation study to discuss the impact of each of the choices involved in designing unrolled networks and present practical recommendations based on our findings. We anticipate that this study will help scientists and engineers to design unrolled networks for their applications and diagnose problems within their networks efficiently.

2024

Joint Optimization of Multiple Resources for Distributed Service Deployment in Satellite Edge Computing Networks Abstract PDF

Jiachen Sun, Xu Chen, Zhen Li, Jiawei Wang, Yuxi Chen

IEEE Internet of Things Journal, 2024

With the emergence of mobile edge applications and the demand for access-as-a-service, satellite mobile edge computing stands out as a disruptive technology for delivering low-latency edge service. In this article, we focus on service deployment to the edge satellites for terrestrial users, which is a key enabling technology in satellite mobile edge computing and will replace traditional centralized cloud computing. Most existing works on service deployment consider a centralized nonconvex optimization problem with high computational overhead. However, in practice, it is difficult for a single satellite to solve computationally expensive network optimization problems. To this end, we propose a distributed optimization model based on the alternating direction method of multipliers (ADMMs), which can relieve the computational burden by leveraging collaborative calculations among multiple satellites. Our proposed model minimizes the total delay of service deployment for terrestrial users by formulating a joint optimization problem that involves deployment decisions, CPU resource decisions, transmission decisions, and caching decisions. Furthermore, we propose a novel approximation method that transforms the nonconvex optimization problem to a convex one to make the joint optimization problem solvable in polynomial time. Finally, we conduct experiments using scaled global population data and show that the proposed distributed model outperforms the baselines.

Teaching

Carnegie Mellon University Teaching Assistant

46-932 Simulation Methods in Option Pricing

Spring 2026

This course initially presents standard topics in simulation including random variable generation, statistical analysis of simulation output and variance reduction methods including antithetic variables, control variables, importance sampling, conditional Monte Carlo, stratification and martingale control variables.

46-929 Financial Time Series Analysis

This course is an introduction to financial time series analysis. We will cover basic time series models (AR, MA, ARMA and ARIMA) and their use in financial applications, including forecasting and the development of quantitative trading strategies.

36-401 Modern Regression

Fall 2025

This course is an introduction to applied data analysis using linear regression modeling. We will derive properties about those models, apply and examine various models for real datasets, assess the validity of modeling assumptions, and determine what conclusions we can make (if any) from those models.

Eric Chen

Bio

News

Publications

Teaching