The following datasets were used in Wong and Moore (2003), Bayesian Network Anomaly Pattern Detection for Disease Outbreaks ICML 2003.
They are stored in this form on this page in order to allow other researchers to run experiments on the same datasets with identical preprocessing, including discretization levels of real-valued attributes and compensation for missing values.
wsare3data.zip (10.7 megs), Zipfile containing 100 datasets described in the paper. It also contains instructions.txt with detailed documentation on the datasets.
Please feel welcome to contact Weng-Keen Wong with questions or comments.