Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | point 'please' to python_hadoop | Bryan Newbold | 2019-09-25 | 1 | -4/+4 |
| | |||||
* | GroupFatcatWorksSubsetJob | Bryan Newbold | 2019-08-26 | 1 | -0/+44 |
| | | | | | | | | | | | | This is a hack-y variant of GroupFatcatWorksSubsetJob which allows setting different left and right sides of the join. The initial application is to re-run work merging with only longtail-oa works on the "left", with the goal of hard-merging these releases into existing releases with actual identifiers (instead of just grouping into works). As a refactor, the normal GroupFatcatWorksJob could just be this with the same file passed as both left and right, though that requires twice as much JSON parsing/filtering. | ||||
* | please command for groupworksfatcat | Bryan Newbold | 2019-08-10 | 1 | -0/+63 |
| | |||||
* | please: add staging config (commented out) | Bryan Newbold | 2019-07-07 | 1 | -0/+4 |
| | |||||
* | scalding dump-grobid-status-code job | Bryan Newbold | 2019-04-12 | 1 | -0/+24 |
| | |||||
* | set long timeout on HBaseStatusCountJob | Bryan Newbold | 2019-02-26 | 1 | -1/+3 |
| | |||||
* | longer match-crossref timeout | Bryan Newbold | 2018-12-18 | 1 | -2/+3 |
| | |||||
* | please support DumpGrobidXmlJob | Bryan Newbold | 2018-10-30 | 1 | -0/+24 |
| | |||||
* | please support for DumpGrobidMetaInsertableJob | Bryan Newbold | 2018-09-22 | 1 | -0/+24 |
| | |||||
* | dumpfilemeta support in please | Bryan Newbold | 2018-09-13 | 1 | -0/+24 |
| | |||||
* | insertable flag for match-crossref | Bryan Newbold | 2018-09-12 | 1 | -1/+9 |
| | |||||
* | match crossref reducers=200 | Bryan Newbold | 2018-08-31 | 1 | -1/+1 |
| | |||||
* | please: save extraction output | Bryan Newbold | 2018-08-26 | 1 | -0/+6 |
| | |||||
* | add extraction_ungrobided support to please | Bryan Newbold | 2018-08-25 | 1 | -0/+30 |
| | |||||
* | please support for DumpUnGrobidedJob | Bryan Newbold | 2018-08-24 | 1 | -0/+24 |
| | |||||
* | Merge branch 'bnewbold-missing-column' | Bryan Newbold | 2018-08-24 | 1 | -0/+29 |
|\ | | | | | | | | | | | Manually Resolved Conflicts: please | ||||
| * | fixes to please keys-missing-col | Bryan Newbold | 2018-08-21 | 1 | -2/+2 |
| | | |||||
| * | add please for keysmissingcolumn | Bryan Newbold | 2018-08-21 | 1 | -0/+29 |
| | | |||||
* | | clarify please docs | Bryan Newbold | 2018-08-24 | 1 | -2/+2 |
| | | |||||
* | | rename ./mapreduce to ./python | Bryan Newbold | 2018-08-24 | 1 | -3/+3 |
| | | |||||
* | | fix merge typos in please | Bryan Newbold | 2018-08-24 | 1 | -2/+2 |
| | | |||||
* | | Merge branch 'bnewbold-match-quality' | Bryan Newbold | 2018-08-24 | 1 | -0/+28 |
|\ \ | |/ |/| | | | | | | | Manually resolved merge conflict in: please | ||||
| * | please support for match-benchmark | Bryan Newbold | 2018-08-21 | 1 | -0/+26 |
| | | |||||
| * | fix bug with qa/prod detection | Bryan Newbold | 2018-08-21 | 1 | -0/+1 |
| | | |||||
* | | Merge branch 'bnewbold-match-scale' | Bryan Newbold | 2018-08-21 | 1 | -0/+5 |
|\ \ | |||||
| * | | explicit spill and compression settings for ScoreJob | Bryan Newbold | 2018-08-20 | 1 | -0/+5 |
| |/ | |||||
* | | HDFS doesn't like colons | Bryan Newbold | 2018-08-21 | 1 | -1/+1 |
| | | |||||
* | | please support for status-code-count | Bryan Newbold | 2018-08-21 | 1 | -0/+24 |
| | | |||||
* | | make col counter generic | Bryan Newbold | 2018-08-21 | 1 | -0/+28 |
| | | |||||
* | | please support for grobid-scorable-dump | Bryan Newbold | 2018-08-21 | 1 | -0/+24 |
|/ | |||||
* | update 'please' command for scoring refactor | Bryan Newbold | 2018-08-15 | 1 | -1/+10 |
| | |||||
* | add 'please' command for crossref matching | Bryan Newbold | 2018-07-27 | 1 | -0/+28 |
| | |||||
* | update please helpers to provide hbase+zk config | Bryan Newbold | 2018-07-15 | 1 | -2/+13 |
| | |||||
* | please: status-count | Bryan Newbold | 2018-06-15 | 1 | -0/+21 |
| | |||||
* | please: extract | Bryan Newbold | 2018-06-15 | 1 | -0/+31 |
| | | | | This script needs refactoring! | ||||
* | please: split out rebuild steps | Bryan Newbold | 2018-06-15 | 1 | -3/+18 |
| | |||||
* | doc improvements and fixes to 'please' helper | Bryan Newbold | 2018-06-15 | 1 | -24/+23 |
| | |||||
* | helper script for running jobs | Bryan Newbold | 2018-06-14 | 1 | -0/+86 |