NasBertTrainer.NasBertOptions Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
public class NasBertTrainer.NasBertOptions : Microsoft.ML.TorchSharp.TorchSharpBaseTrainer.Options
type NasBertTrainer.NasBertOptions = class
inherit TorchSharpBaseTrainer.Options
Public Class NasBertTrainer.NasBertOptions
Inherits TorchSharpBaseTrainer.Options
- Inheritance
- Derived
Constructors
NasBertTrainer.NasBertOptions() |
Fields
ActivationDropout |
Dropout rate after activation functions in FFN layers. Should be within [0, 1). |
AdamBetas |
Betas for Adam optimizer. |
AdamEps |
Epsilon for Adam optimizer. |
AttentionDropout |
Dropout rate for attention weights. Should be within [0, 1). |
BatchSize |
Number of samples to use for mini-batch training. (Inherited from TorchSharpBaseTrainer.Options) |
ClipNorm |
The clipping threshold of gradients. Should be within [0, +Inf). 0 means not to clip norm. |
Dropout |
Dropout rate for general situations. Should be within [0, 1). |
DynamicDropout |
Whether to use dynamic dropout. |
EncoderNormalizeBefore |
Whether to apply layer normalization before each encoder block. |
FinalLearningRateRatio |
The final learning rate for polynomial decay scheduler. (Inherited from TorchSharpBaseTrainer.Options) |
FreezeEncoder |
Whether to freeze encoder parameters. |
FreezeTransfer |
Whether to freeze transfer module parameters. |
LabelColumnName |
The label column name. (Inherited from TorchSharpBaseTrainer.Options) |
LayerNormTraining |
Whether to train layer norm parameters. |
LearningRate |
Learning rate for the first N epochs; all epochs >N using LR_N. Note: this may be interpreted differently depending on the scheduler. |
MaxEpoch |
Stop training when reaching this number of epochs. (Inherited from TorchSharpBaseTrainer.Options) |
PoolerDropout |
Dropout rate in the masked language model pooler layers. Should be within [0, 1). |
PredictionColumnName |
The Prediction column name. (Inherited from TorchSharpBaseTrainer.Options) |
ScoreColumnName |
The Score column name. (Inherited from TorchSharpBaseTrainer.Options) |
Sentence1ColumnName |
The first sentence column. |
Sentence2ColumnName |
The second sentence column. |
StartLearningRateRatio |
The start learning rate for polynomial decay scheduler. (Inherited from TorchSharpBaseTrainer.Options) |
TaskType |
Task type, which is related to the model head. |
ValidationSet |
The validation set used while training to improve model quality. (Inherited from TorchSharpBaseTrainer.Options) |
WarmupRatio |
Proportion of warmup steps for polynomial decay scheduler. |
WeightDecay |
Coefficiency of weight decay. Should be within [0, +Inf). (Inherited from TorchSharpBaseTrainer.Options) |