Page history
3 March 2022
David
→Neural Networks
+109
David
→Linear Regression
+12
David
→Neural Networks
+25
David
→Neural Networks
+16
David
→Kernel Method
+66
David
→Kernel Method
+38
David
→Kernel Method
+10
David
no edit summary
+70
25 January 2021
8 December 2020
David
→Actor-critic algorithms
+503
David
→Policy Gradient Method
+528
David
→Policy Gradient Method
+936
David
→Policy Gradient Method
+7
David
→Training using Gradient Descent/Ascent
+451
David
→Training using Gradient Descent/Ascent
+114
David
→Lecture (Dec 8)
+302
David
→Deep Reinforcement Learning
+435
David
→Theorem 4.1 (Uniform conditioning implies PL* condition)
3 December 2020
David
→Policy Iteration
+1,446
David
→Optimal Policy
+621
David
→Optimal Policy
+665
David
→Optimal Policy
+408
David
→Classical RL
+487
David
→Classical RL
+657
David
→Classical RL
+1
1 December 2020
24 November 2020
19 November 2020
David
→Long Short Term Memory (LSTMs)
+667
David
→LSTMs and transformers
+675
David
→Recurrent Neural Networks (RNNs)
+2
David
→Recurrent Neural Networks (RNNs)
+112
David
no edit summary
David
→Theory of self-supervised learning
+99
17 November 2020
12 November 2020
10 November 2020
David
→Contrastive Learning
+7
David
→Contrastive Learning
+10
David
→Contrastive Learning
−7
David
→Self-supervised Learning
+2,878
David
→Which method for generalization works the best?
+61
5 November 2020
29 October 2020
David
→Domain Adaptation
+1
David
→Practical Domain Adaptation Methods
+1,380
David
→Practical Domain Adaptation Methods
+1,666
David
→Domain Adaptation
+1,167