The hard thresholding regularised logistic regression in high dimensions with larger number of features than samples is considered. The sharp oracle inequality for the global solution is established. If the target signal is detectable, it is proven that with a high probability the estimated and true supports coincide. Starting with the KKT condition, we introduce the primal and dual active sets algorithm for fitting and also consider a sequential version of this algorithm with a warm-start strategy. Simulations and a real data analysis show that SPDAS outperforms LASSO, MCP and SCAD methods in terms of computational efficiency, estimation accuracy, support recovery and classification.
第一作者单位:[1]Wuhan Univ, Sch Math & Stat, Wuhan 430072, Peoples R China
通讯作者:
推荐引用方式(GB/T 7714):
Kang Lican,Liu Yanyan,Luo Yuan,et al.Hard Thresholding Regularised Logistic Regression: Theory and Algorithms[J].EAST ASIAN JOURNAL ON APPLIED MATHEMATICS.2022,12(1):35-52.doi:10.4208/eajam.110121.210621.
APA:
Kang, Lican,Liu, Yanyan,Luo, Yuan&Zhu, Chang.(2022).Hard Thresholding Regularised Logistic Regression: Theory and Algorithms.EAST ASIAN JOURNAL ON APPLIED MATHEMATICS,12,(1)
MLA:
Kang, Lican,et al."Hard Thresholding Regularised Logistic Regression: Theory and Algorithms".EAST ASIAN JOURNAL ON APPLIED MATHEMATICS 12..1(2022):35-52