TY - JOUR
T1 - HydaLearn
T2 - Highly Dynamic Task Weighting for Multitask Learning with Auxiliary Tasks
AU - Verboven, Sam
AU - Chaudhary, Muhammad Hafeez
AU - Berrevoets, Jeroen
AU - Ginis, Vincent
AU - Verbeke, Wouter
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2023/3
Y1 - 2023/3
N2 - Multitask learning (MTL) can improve performance on one task by sharing representations with one or more related auxiliary tasks. Usually, MTL networks are trained on a composite loss function formed by a fixed weighted combination of separate task losses. In practice, however, static loss weights lead to poor results for two reasons. First, the relevance of the auxiliary tasks gradually drifts throughout the learning process. Second, for minibatch-based optimization, the optimal task weights vary significantly from one update to the next depending on the minibatch sample composition. Here, we introduce HydaLearn, an intelligent weighting algorithm that connects the main-task gain to the individual task gradients, to inform dynamic loss weighting at the minibatch level, addressing the two above shortcomings. We demonstrate significant performance increases on synthetic data and two real-world data sets.
AB - Multitask learning (MTL) can improve performance on one task by sharing representations with one or more related auxiliary tasks. Usually, MTL networks are trained on a composite loss function formed by a fixed weighted combination of separate task losses. In practice, however, static loss weights lead to poor results for two reasons. First, the relevance of the auxiliary tasks gradually drifts throughout the learning process. Second, for minibatch-based optimization, the optimal task weights vary significantly from one update to the next depending on the minibatch sample composition. Here, we introduce HydaLearn, an intelligent weighting algorithm that connects the main-task gain to the individual task gradients, to inform dynamic loss weighting at the minibatch level, addressing the two above shortcomings. We demonstrate significant performance increases on synthetic data and two real-world data sets.
KW - Adaptive multitask learning
KW - Default prediction
KW - Machine learning
KW - Mortality prediction
KW - Neural networks
UR - http://www.scopus.com/inward/record.url?scp=85133352554&partnerID=8YFLogxK
U2 - 10.1007/s10489-022-03695-x
DO - 10.1007/s10489-022-03695-x
M3 - Article
AN - SCOPUS:85133352554
SN - 0924-669X
VL - 53
SP - 5808
EP - 5822
JO - Applied Intelligence
JF - Applied Intelligence
IS - 5
ER -