Sen(Qian)’s Memo

This website is Donglin Qian (Torin Sen)’s memo, especially about machine learning papers and competitive programming.

PU3/4

2024-07-17

2020-PRL-[PURE]Positive-unlabeled learning for open set domain adaptation

2024-07-01

PU Curriculum Learning Self-Training Sample-Selection Self-supervised Self-Paced Learning Paper Detail Article Case-Control

2023-KDD-[RobustPU]Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction

典型的なCurriculum Learningを導入したPU。

→Read more

2024-06-26

PU Domain Shift Domain Adaptation Paper SAR Bias Detail Article UU Reweighting Noisy-Label Small Loss Trick PNU NU

2020-NIPS-[aPU]Learning from Positive and Unlabeled Data with Arbitrary Positive Shift

abs-puを開発。これはnnPUの式のmaxを絶対値に。全体的な流れは、N in U in train, N in testが同分布という仮定。まずはtrain同士でPU learningして、そこからp(y=-1|x)から比率で変換して、うまくUからN in Uを抽出する。そして、test domainにあるデータとNUかPNU Learningする。

→Read more

2024-06-09

Paper PU Bias Case-Control SCAR SAR Domain Shift Wasserstein Distance Concentration Inequality Theoretical Analysis Gradient Penalty Ada Boost Regularization Term

2023-AAAI-[GradPU]Positive-Unlabeled Learning via Gradient Penalty and Positive Upweighting

理論的に面白いのは、ワッサースタイン距離で誤差上界を評価できること。普通のPositiveと経験的Positive in Unlabeledの評価ができている。その理論的な結果から、損失関数と識別器の合成写像のリプシッツ定数が小さいほうが望ましい。また、真のPositiveの分布と、Positive in UnlabeledにDomain Shiftが生じて、矛盾するようなDomain Shiftが得られた(間違ったラベルとか)とすると、識別器はなめらかではなくなりGradientが大きくなる。 P in Uの学習とPの学習は上界から評価する限りだと、トレードオフの関係にありそう。提案手法として、Gradient PenaltyとAdaboostのような重みづけで学習促進がある。Class Priorは使わず、その代わりに学習はAdaBoostの機構による重みづけで行っている。

→Read more

2024-06-02

PU Sample-Selection Paper Bias Memorization Effect OHO^~Small Loss Trick

2019-NIPS workshop-[aaPU] Revisiting Sample Selection Approach to Positive-Unlabeled Learning- Turning Unlabeled Data into Positive rather than Negative

まずはnnPUで訓練し、ある程度信頼できるモデルにする。Noisy LabelのSmall Loss Trickを使い、そのあとから、Unlabeledの中のlossが大きいものを選んで、Positive扱いにする。しかし、Uから選んだPositive扱いのものは、nnPUでmaxを取った項の中での計算はさせない(強い過学習傾向がnnPUでさえ見られてしまう).

→Read more

2024-05-23

PU Cost-Sensitive SAR Bias Case-Control Paper

2019-ICLR-[PUSB]Learning from Positive and Unlabeled Data with a Selection Bias

→Read more

2024-05-22

PU Theoretical Analysis Bias SCAR Single-Training-Set Case-Control SAR EM-Algorithm Paper

2019-ECML PKDD-[PWE]Beyond the Selected Completely At Random Assumption for Learning from Positive and Unlabeled Data

BiasつきのPUについて、数理的に考察をし手法も提案した論文。propensity scoreという量を導入し、それを損失関数の重みに寄与させることでbiasを考慮できるとした。それをRiskの式に導入したのちに、推定の手法として2つの変数があるので(propensity scoreと本体の推定器)、EMアルゴリズムで交互に最適化をしていた。

→Read more

2024-05-21

PU Cost-Sensitive Case-Control Gradient Ascent Paper