I am building a model where firms have to set prices and make production decisions. Prices are continuous and so are the decision variables. (inventory, last sales, prices...).
What reinforcement learning method can I use that maps continuous to continuous ? Which python packages are there? If there are no python packages, I could write a wrapper.