Skip to content
Nilesh Chakraborty edited this page Aug 18, 2014 · 4 revisions

First Things

Firstly, please follow README.md, download all the required things, build and run your first extraction.

Before running the first extraction though, keep a backup of the base-dir directory so that the YYYYMMDD directory contains only the XXwiki-YYYYMMDD-download-complete file and NOT the XXwiki-YYYYMMDD-extraction-complete file or gzip-compressed RDF output files. This will come handy during the extraction test.

Running all unit tests

You can use the sample download/src/test/resources/download.properties file to download the li dump (you'll need to change download=en,li,ilo,bn:pages-articles-multistream.xml.bz2 to download=li:pages-articles-multistream.xml.bz2).

Modify the base-dir in both download.properties and config.properties. Set languages in config.properties to li. We're using the liwiki dump because it's small and easy to test DistRedirectsTest with.

mvn test

That should run all tests, including DistRedirectsTest.

Testing Extraction Outputs

Before running the extraction test, remove the base-dir directory and replace it completely with the fresh backup so that we don't mess up the diff's.

$ ./run-extraction-test ../config.properties ../dist-config.properties

At the end of the test, if all goes well, it should say:

====================================================================
Computing diff:
====================================================================
Outputs match!
Clone this wiki locally