| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | add ojs and dspace as in-domain patterns to look for in heuristic CDX PDF filter | Bryan Newbold | 2019-04-12 | 1 | -1/+1 |
| | | |||||
| * | distinct on SHA1 in cdx scripts | Bryan Newbold | 2018-05-08 | 1 | -4/+10 |
| | | |||||
| * | pig script to filter GWB CDX by SURT regexes | Bryan Newbold | 2018-05-07 | 1 | -0/+41 |
