Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | add ojs and dspace as in-domain patterns to look for in heuristic CDX PDF filter | Bryan Newbold | 2019-04-12 | 1 | -1/+1 |
| | |||||
* | distinct on SHA1 in cdx scripts | Bryan Newbold | 2018-05-08 | 1 | -4/+10 |
| | |||||
* | pig script to filter GWB CDX by SURT regexes | Bryan Newbold | 2018-05-07 | 1 | -0/+41 |