Skip to content

Better reservoir sampling, using random Fourier features

Notifications You must be signed in to change notification settings

tbrx/rff-reservoir

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Better reservoir sampling, using random Fourier features!

This code implements the online algorithm for selecting a representative subset from streaming data, as described in this paper:

Paige, B., Sejdinovic, D., & Wood, F. (2016). Super-sampling with a Reservoir. In Proceedings of the 32nd Annual Conference on Uncertainty in Artificial Intelligence, UAI 32: 567–576.

For usage, see the example notebooks:

The code is not particularly optimized at this point. In particular, overhead from explicit looping over data structures in python means the online algorithm can be slower than a batch algorithm for moderately-sized data.

About

Better reservoir sampling, using random Fourier features

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published