5,321
edits
(→L) |
(→G) |
||
Line 34: | Line 34: | ||
* Generalization - How well a model works on data it has not been trained on. | * Generalization - How well a model works on data it has not been trained on. | ||
* [[Generative adversarial network]] (GAN) - A neural network setup for generating examples from a training distribution. | * [[Generative adversarial network]] (GAN) - A neural network setup for generating examples from a training distribution. | ||
* Generative Pretrained Transformer (GPT) - A large decoder-only transformer trained on next-word prediction. | |||
* Gradient Descent - The operation used to update parameters when optimizing neural network. Also known as direction of steepest descent. | * Gradient Descent - The operation used to update parameters when optimizing neural network. Also known as direction of steepest descent. | ||
* [[Graph neural network]] (GNN) - A type of neural network which operates on graph inputs. | * [[Graph neural network]] (GNN) - A type of neural network which operates on graph inputs. |