diff options
author | Bryan Newbold <bnewbold@archive.org> | 2018-05-08 10:06:14 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2018-05-08 10:06:20 -0700 |
commit | 18a55d37a87d4391bd8161201c523dd7d7f0f1e7 (patch) | |
tree | 86db4c84cf4fd0dde5ea9508617344018e640104 /pig/tests/files/papers_url_doi.cdx | |
parent | 1831a3b4495aee275e4b4b187fa545eba75eb87b (diff) | |
download | sandcrawler-18a55d37a87d4391bd8161201c523dd7d7f0f1e7.tar.gz sandcrawler-18a55d37a87d4391bd8161201c523dd7d7f0f1e7.zip |
fix tests post-DISTINCT
Confirms it's working!
Diffstat (limited to 'pig/tests/files/papers_url_doi.cdx')
-rw-r--r-- | pig/tests/files/papers_url_doi.cdx | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/pig/tests/files/papers_url_doi.cdx b/pig/tests/files/papers_url_doi.cdx index 1ad5792..ee90fb1 100644 --- a/pig/tests/files/papers_url_doi.cdx +++ b/pig/tests/files/papers_url_doi.cdx @@ -3,5 +3,5 @@ # should match 2: -org,ametsoc,journals)/doi/pdf/10.1175/2008BAMS2370.1 20170706005950 http://mit.edu/file.pdf application/pdf 200 MQHD36X5MNZPWFNMD5LFOYZSFGCHUN3V - - 123 456 CRAWL/CRAWL.warc.gz -org,nejm,www)/doi/pdf/10.1056/NEJMoa1013607 20170706005950 http://mit.edu/file.pdf application/pdf 200 MQHD36X5MNZPWFNMD5LFOYZSFGCHUN3V - - 123 456 CRAWL/CRAWL.warc.gz +org,ametsoc,journals)/doi/pdf/10.1175/2008BAMS2370.1 20170706005950 http://mit.edu/file.pdf application/pdf 200 4QHD36X5MNZPWFNMD5LFOYZSFGCHUN3V - - 123 456 CRAWL/CRAWL.warc.gz +org,nejm,www)/doi/pdf/10.1056/NEJMoa1013607 20170706005950 http://mit.edu/file.pdf application/pdf 200 3QHD36X5MNZPWFNMD5LFOYZSFGCHUN3V - - 123 456 CRAWL/CRAWL.warc.gz |