Large Multi‐Response Linear Regression Estimation Based on Low‐Rank Pre‐Smoothing

Tian, Xinle and Gibberd, Alex and Nunes, Matthew A. and Roy, Sandipan (2026) Large Multi‐Response Linear Regression Estimation Based on Low‐Rank Pre‐Smoothing. Statistical Analysis and Data Mining, 19 (2): e70072. ISSN 1932-1864

[thumbnail of sam.70072.pdf]
Text (sam.70072.pdf)
sam.70072.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB)

Abstract

Pre‐smoothing is a technique aimed at increasing the signal‐to‐noise ratio in data to improve subsequent estimation and model selection in regression problems. However, pre‐smoothing has thus far been limited to the univariate response regression setting. However, there are many scientific applications in which interest lies in multi‐response regression problems, particularly when the number of responses is large. Motivated by this setting, this article proposes a technique for data pre‐smoothing based on low‐rank approximation. We establish theoretical results on the performance of the proposed methodology, which show that in this large‐response setting, the proposed technique outperforms ordinary least squares estimation with the mean squared error criterion, whilst being computationally more efficient than alternative approaches such as reduced rank regression. We quantify our estimator's benefit empirically in a number of simulated experiments. We also demonstrate our proposed low‐rank pre‐smoothing technique on real data arising from the environmental and biological sciences.

Item Type:
Journal Article
Journal or Publication Title:
Statistical Analysis and Data Mining
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/2600/2603
Subjects:
?? low‐rank approximationpre‐smoothingparameter estimationmulti‐response modelpredictionanalysisinformation systemscomputer science applications ??
ID Code:
236735
Deposited By:
Deposited On:
22 Apr 2026 13:50
Refereed?:
Yes
Published?:
Published
Last Modified:
23 Apr 2026 02:10