Algorithms for Deciding the Safety of States in Fully Observable Non-deterministic Problems
Suppose you have a learned policy for a Fully Observable Non-deterministic (FOND) problem. How can you be sure that it is safe? One approach is via fault a...