5,321
edits
(→L) |
(→G) |
||
Line 36: | Line 36: | ||
* Generalization - How well a model works on data it has not been trained on. | * Generalization - How well a model works on data it has not been trained on. | ||
* [[Generative adversarial network]] (GAN) - A neural network setup for generating examples from a training distribution. | * [[Generative adversarial network]] (GAN) - A neural network setup for generating examples from a training distribution. | ||
* Generative Pretrained Transformer (GPT) - A large decoder-only transformer trained on next-word prediction. | * Generative Pretrained Transformer (GPT) - A large decoder-only transformer trained on next-word prediction. GPT-2, GPT-3 refers specific models owned by OpenAI. | ||
* Gradient Descent - The operation used to update parameters when optimizing neural network. Also known as direction of steepest descent. | * Gradient Descent - The operation used to update parameters when optimizing neural network. Also known as direction of steepest descent. | ||
* [[Graph neural network]] (GNN) - A type of neural network which operates on graph inputs. | * [[Graph neural network]] (GNN) - A type of neural network which operates on graph inputs. |