I went overboard analyzing the Praise Quant data this month as a practice problem/skill development exercise for Reinforcement Learning.
I am a bit out of the loop regarding TEC’s is currently at with the rewards system. I just wanted to share the report to see what peoples’ impressions are.
I like working with written language and have been working towards a similar control-system/feedback loop for a power-electronics project.
What can I say, to a man training up in ‘Ostrom-style IAD’, every problem looks like a ‘Markov Decision Process’
Im not really proposing anything at this time, just sharing the report to see what kind of impressions it gets.
Rewards Team? I helped Krisofer with some of the data parsing earlier this year.
- This is an idea or something I just need advice to proceed with.