Sen(Qian)’s Memo

This website is Donglin Qian (Torin Sen)’s memo, especially about machine learning papers and competitive programming.

Data Augmentation

2024-09-20

2022-ICLR-[P3Mix]Who Is Your Right Mixup Partner in Positive and Unlabeled Learning

PUのcost-sensitiveのnegative assumptionを防ぐため、識別境界に近い(これは識別確率が[0.5-x, 0.5+x]に含まれているって感じの)データをmixupして学習させるという手法。面白いのは、cost sensitiveなのにPUの項の重みは固定ではなくハイパラにしているところ。

→Read more

2024-06-14

Self-supervised Data Augmentation

2020-NIPS-[FixMatch]Simplifying Semi-Supervised Learning with Consistency and Confidence

Data Augmentationして損失に導入する。ここで、強い拡張と弱い拡張の両方を考えてみる。 1. ラベルありデータの損失 2. ラベルなしデータに弱い拡張を施し予測されたラベルをhard labelにする。そのhard labelと同じラベルありデータに強い拡張を施したものの損失を計算する。弱い拡張は左右反転とか平行移動。強い拡張はモザイクとか

→Read more