Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | refactor old python hadoop code into new directory | Bryan Newbold | 2019-09-25 | 4 | -591/+0 |
* | re-write parse_cdx_line for sandcrawler lib | Bryan Newbold | 2019-09-25 | 1 | -1/+31 |
* | fix test grobid2json test | Bryan Newbold | 2019-09-25 | 1 | -1/+4 |
* | start refactoring sandcrawler python common code | Bryan Newbold | 2019-09-23 | 2 | -0/+41 |
* | update grobid2json to include given_name/surname | Bryan Newbold | 2019-05-13 | 1 | -3/+3 |
* | python test fixes | Bryan Newbold | 2019-02-21 | 1 | -1/+1 |
* | fix ungrobid extraction tests | Bryan Newbold | 2018-11-22 | 1 | -2/+4 |
* | longtail grobid metadata parse/filter WIP | Bryan Newbold | 2018-09-22 | 1 | -0/+5 |
* | WIP: ungrobided doesn't inherit (copypasta) | Bryan Newbold | 2018-08-25 | 1 | -4/+4 |
* | ungrobided: example real output | Bryan Newbold | 2018-08-25 | 1 | -0/+20 |
* | ungrobided: add real results to tests | Bryan Newbold | 2018-08-25 | 1 | -1/+51 |
* | python extraction_ungrobided job | Bryan Newbold | 2018-08-24 | 1 | -0/+126 |
* | rename ./mapreduce to ./python | Bryan Newbold | 2018-08-24 | 8 | -0/+2632 |