connect
your data
define your
success criteria
generate a
dataset
setup the rl
environment
automated by castform library,
extendable by you
we run the rl
training loop
we handle this,
you have full observability
you get a fine-
tuned model