Cyber physical systems (CPS) are changing the way machine tools function and operate. As the CAD-CAM-CNC tool chain gains intelligence the boundaries of the elements of the tool chain become blurred and new features, based on advancements in artificial intelligence can be integrated. The main task of the CAD-CAM-CNC chain is to generate the cutter trajectories for the manufacturing operation. Driven by sustainability and the need for capacity, the need arises to optimize the paths through this tool chain. In this paper a concept for path optimization with reinforcement learning is proposed, with focus on the reward function, specific to tool path optimization via the channel method.