Reinforcement Learning Training

Now, that we have defined a reward function, we are ready to start a fine-tuning job on the Augento Platform.

In the sidebar, go to Training and select the Start Training tab.

Select the model, which you previously connected to your agent in the dropdown menu.

Select Training Data

Now, in the table view, select the queries you want your fine-tuned model to be trained on. In most cases, these are queries where your running production system failed and where you want the model to improve on.

Connect Reward Function

Now connect the reward function. Click on Connect Reward Function and simply paste the URL of the endpoint from the previous step.

Training parameters

You can change the hyperparameters of the training, if you wish. However, we already provide defaults defaults that will fit most use cases.

⚠️

Changing the number of epochs will affect the price of the training.

Start Training

No you can start the fine-tuning training by clicking Start Training. You can view the submitted job in the Training Runs tab on the same page.