Better reservoir sampling, using random Fourier features!

This code implements the online algorithm for selecting a representative subset from streaming data, as described in this paper:

Paige, B., Sejdinovic, D., & Wood, F. (2016). Super-sampling with a Reservoir. In Proceedings of the 32nd Annual Conference on Uncertainty in Artificial Intelligence, UAI 32: 567–576.

For usage, see the example notebooks:

The code is not particularly optimized at this point. In particular, overhead from explicit looping over data structures in python means the online algorithm can be slower than a batch algorithm for moderately-sized data.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ext		ext
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Better reservoir sampling, using random Fourier features!

About

Releases

Packages

Languages

tbrx/rff-reservoir

Folders and files

Latest commit

History

Repository files navigation

Better reservoir sampling, using random Fourier features!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages