connect
your data
→
define your
success criteria
→
generate a
dataset
→
setup the rl
environment
automated by castform library,
extendable by you
→
we run the rl
training loop
we handle this,
you have full observability
→
you get a fine-
tuned model