A Bayesian scan statistic for spatial cluster detection
Document Type: Paper
Tags: Biosurveillance, Spatial Scan, Spatial Statistics This paper develops a new Bayesian method for cluster detection, the ?Bayesian spatial scan statistic,? and compares this method to the standard (frequentist) scan statistic approach on the task of prospective disease surveillance.
A Bayesian spatial scan statistic
Document Type: Paper
Tags: Biosurveillance, Spatial Scan, Spatial Statistics We propose a new Bayesian method for spatial cluster detection, the ?Bayesian spatial scan statistic,? and compare this method to the standard (frequentist) scan statistic approach. We demonstrate that the Bayesian statistic has several advantages over the frequentist approach, including increa...
Accelerating Exact kmeans Algorithms with Geometric Reasoning
Document Type: Paper
Tags: Statistical Data Mining for Astrophysics, Cached Sufficient Statistics, Astrostatistics, Clustering, Efficient Statistical Algorithms, Kdtrees and Balltrees, Mixture Models A Kmeans tutorial. We present new algorithms for the kmeans clustering problem. They use the kdtree data structure to reduce the large number of nearestneighbor queries issued by the traditional algorithm. Sufficient statistics are stored in the nodes of the kdtree. Then, an analysis of th...
Accelerating Exact kmeans Algorithms with Geometric Reasoning (Extended version)
Document Type: Paper
Tags: Statistical Data Mining for Astrophysics, Cached Sufficient Statistics, Clustering, Kdtrees and Balltrees, Efficient Statistical Algorithms, Mixture Models This is an extended version of the KDD99 paper (available here. We present new algorithms for the kmeans clustering problem. They use the kdtree data structure to reduce the large number of nearestneighbor queries issued by the traditional algorithm. Sufficient statistics are stored in the no...
A Comparison of Statistical and Machine Learning Algorithms on the Task of Link Completion
Document Type: Paper
Tags: GDA, Testing, Link Analysis, Efficient Statistical Algorithms, Applications Link data, consisting of a collection of subsets of entities, can be an important source of information for a variety of fields including the social sciences, biology, criminology, and business intelligence. However, these links may be incomplete, containing one or more unknown members. We consi...
A Composite Likelihood View for MultiLabel Classification
Document Type: Paper
Tags: Given limited training samples, learning to classify multiple labels is challenging. Problem decomposition is widely used in this case, where the original problem is decomposed into a set of easiertolearn subproblems, and predictions from subproblems are combined to make the final decision. In...
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems
Document Type: Paper
Tags: Stability is a desirable characteristic for linear dynamical systems, but it is often ignored by algorithms that learn these systems from data. We propose a novel method for learning stable linear dynamical systems: we formulate an approximation of the problem as a convex program, start with a ...
Acquisition of Dynamic Control Knowledge for a Robotic Manipulator
Document Type: Paper
Tags: Kdtrees and Balltrees, Memorybased Learning, Active Learning To make efficient use of a dynamic system such as a mechanical manipulator, the robotic controller needs various models of its behaviour. I describe a method of learning in which all the experiences in the lifetime of the robot are explicitly remembered. They are stored in a manner which permits...
Active Area Search via Bayesian Quadrature
Document Type: Paper
Tags: The selection of data collection locations is a problem that has received significant research attention from classical design of experiments to various recent active learning algorithms. Typical objectives are to map an unknown function, optimize it, or find level sets in it. Each of these obje...
