• 中文核心期刊要目总览
  • 中国科技核心期刊
  • 中国科学引文数据库(CSCD)
  • 中国科技论文与引文数据库(CSTPCD)
  • 中国学术期刊文摘数据库(CSAD)
  • 中国学术期刊(网络版)(CNKI)
  • 中文科技期刊数据库
  • 万方数据知识服务平台
  • 中国超星期刊域出版平台
  • 国家科技学术期刊开放平台
  • 荷兰文摘与引文数据库(SCOPUS)
  • 日本科学技术振兴机构数据库(JST)

LightAD: 基于自适应采样的自动去偏框架加速方法

LightAD: accelerating AutoDebias with adaptive sampling

  • 摘要: 在推荐系统中,由于数据是从用户行为中收集而来而不是通过合理的实验得到,因此偏差问题是普遍存在的。自动去偏框架(AutoDebias)通过采用元学习寻找适当的去偏置配置 (即伪标签和置信权重) ,已被证明是应对各种偏差的一种通用和有效的解决方案。然而,为每个用户-物品对设置伪标签和权重是一个极度耗时的过程。因此,自动去偏框架面临巨大的计算成本,这使其在实际应用中的适用性较差。虽然使用带有均匀采样器的随机梯度下降可以加速模型训练,但这会显著恶化模型的收敛性和稳定性。为了解决这个问题,我们提出了LightAutoDebias (简称LightAD) ,它为自动去偏框架配备了一种专门的权重采样策略。该采样器可以自适应地和动态地获取信息量更大的训练样本,这带来的收敛性和稳定性比标准的均匀采样器更好。在三个基准数据集上的广泛实验证明,我们的LightAD可以加速自动去偏框架的训练达几个数量级,同时保持几乎相同的精度。

     

    Abstract: In recommendation systems, bias is ubiquitous because the data are collected from user behaviors rather than from reasonable experiments. AutoDebias, which resorts to metalearning to find appropriate debiasing configurations, i.e., pseudolabels and confidence weights for all user-item pairs, has been demonstrated to be a generic and effective solution for tackling various biases. Nevertheless, setting pseudolabels and weights for every user-item pair can be a time-consuming process. Therefore, AutoDebias suffers from an enormous computational cost, making it less applicable to real cases. Although stochastic gradient descent with a uniform sampler can be applied to accelerate training, this approach significantly deteriorates model convergence and stability. To overcome this problem, we propose LightAutoDebias (short as LightAD), which equips AutoDebias with a specialized importance sampling strategy. The sampler can adaptively and dynamically draw informative training instances, which results in better convergence and stability than does the standard uniform sampler. Several experiments on three benchmark datasets validate that our LightAD accelerates AutoDebias by several magnitudes while maintaining almost equal accuracy.

     

/

返回文章
返回