Performance & Stability
        
        How Can a Composite Reward Function Prevent Reward Hacking in Hedging Agents?
        
         
        
        
          
        
        
      
        
     
        
        A composite reward function prevents reward hacking by architecting a multi-dimensional objective that balances primary goals with risk and cost constraints.

 
  
  
  
  
 