| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | notes on large-domain ingest tweaks | Bryan Newbold | 2021-05-27 | 1 | -0/+480 | 
| | | |||||
| * | 2021-04 unpaywall crawl notes | Bryan Newbold | 2021-05-27 | 1 | -0/+368 | 
| | | |||||
| * | late-2020 OA DOI crawl ingest notes | Bryan Newbold | 2021-01-04 | 1 | -3/+46 | 
| | | |||||
| * | DOAJ crawl ingest stats | Bryan Newbold | 2020-12-31 | 1 | -0/+295 | 
| | | |||||
| * | progress notes on OA DOI ingest (still running) | Bryan Newbold | 2020-12-28 | 1 | -11/+102 | 
| | | |||||
| * | HTML ingest deployment notes | Bryan Newbold | 2020-12-16 | 1 | -1/+71 | 
| | | |||||
| * | unpaywall crawl/ingest update (from Oct 2020) | Bryan Newbold | 2020-12-08 | 1 | -0/+134 | 
| | | |||||
| * | commit sept 2020 scielo ingest notes | Bryan Newbold | 2020-12-08 | 1 | -0/+21 | 
| | | |||||
| * | add implementation notes about HTML ingest | Bryan Newbold | 2020-11-10 | 1 | -0/+248 | 
| | | |||||
| * | fuzzy matching notes | Bryan Newbold | 2020-11-10 | 1 | -0/+148 | 
| | | |||||
| * | unpaywall oct 2020 crawl notes | Bryan Newbold | 2020-11-02 | 1 | -45/+82 | 
| | | |||||
| * | more notes on unpaywall ingest from last week | Bryan Newbold | 2020-10-27 | 1 | -0/+73 | 
| | | |||||
| * | notes on 2020-09 re-ingest passes | Bryan Newbold | 2020-10-17 | 1 | -0/+197 | 
| | | |||||
| * | OA DOIs: partial notes | Bryan Newbold | 2020-10-17 | 1 | -0/+218 | 
| | | |||||
| * | notes/status on daily ingest | Bryan Newbold | 2020-10-17 | 1 | -0/+193 | 
| | | |||||
| * | start 2020-10 ingest notes | Bryan Newbold | 2020-10-11 | 1 | -0/+42 | 
| | | |||||
| * | update unpaywall 2020-04 notes | Bryan Newbold | 2020-10-11 | 1 | -0/+32 | 
| | | |||||
| * | OAI-PMH ingest progress timestamps | Bryan Newbold | 2020-10-11 | 1 | -0/+13 | 
| | | |||||
| * | notes on file_meta task (from august) | Bryan Newbold | 2020-10-01 | 1 | -0/+66 | 
| | | |||||
| * | OAI-PMH ingest notes | Bryan Newbold | 2020-09-03 | 1 | -0/+232 | 
| | | |||||
| * | daily ingest notes | Bryan Newbold | 2020-09-02 | 1 | -0/+202 | 
| | | |||||
| * | follow-up notes on processing 'holes' | Bryan Newbold | 2020-09-02 | 1 | -0/+19 | 
| | | |||||
| * | unpaywall ingest follow-up | Bryan Newbold | 2020-09-02 | 1 | -0/+115 | 
| | | |||||
| * | grobid+pdftext missing catch-up commands | Bryan Newbold | 2020-08-05 | 1 | -0/+101 | 
| | | |||||
| * | MAG ingest follow-up notes | Bryan Newbold | 2020-08-05 | 1 | -0/+194 | 
| | | |||||
| * | MAG 2020-07 ingest notes | Bryan Newbold | 2020-07-08 | 1 | -0/+159 | 
| | | |||||
| * | 2020-05_pubmed ingest notes (short) | Bryan Newbold | 2020-06-25 | 1 | -0/+10 | 
| | | |||||
| * | commit old notes on a one-off CDX table cleanup | Bryan Newbold | 2020-06-25 | 1 | -0/+34 | 
| | | |||||
| * | commit old (2020-02) pdftrio commands | Bryan Newbold | 2020-06-25 | 1 | -0/+162 | 
| | | |||||
| * | ingest: OAI-PMH count table | Bryan Newbold | 2020-05-28 | 1 | -0/+24 | 
| | | |||||
| * | ingest notes | Bryan Newbold | 2020-05-26 | 2 | -6/+76 | 
| | | |||||
| * | potential future backfill ingests | Bryan Newbold | 2020-05-26 | 1 | -0/+52 | 
| | | |||||
| * | ingests: normalize file names; commit updates | Bryan Newbold | 2020-05-26 | 10 | -63/+279 | 
| | | |||||
| * | summarize datacite and MAG 2020 crawls | Bryan Newbold | 2020-05-05 | 2 | -0/+200 | 
| | | |||||
| * | update MAG crawl notes | Bryan Newbold | 2020-04-28 | 1 | -0/+71 | 
| | | |||||
| * | COVID-19 chinese paper ingest | Bryan Newbold | 2020-04-15 | 1 | -0/+73 | 
| | | |||||
| * | 2020-04 unpaywall ingest (in progress) | Bryan Newbold | 2020-04-15 | 1 | -0/+63 | 
| | | |||||
| * | 2020-04 datacite ingest (in progress) | Bryan Newbold | 2020-04-15 | 1 | -0/+18 | 
| | | |||||
| * | partial notes on S2 crawl ingest | Bryan Newbold | 2020-04-15 | 1 | -0/+35 | 
| | | |||||
| * | MAG import notes | Bryan Newbold | 2020-04-13 | 1 | -0/+13 | 
| | | |||||
| * | MAG 2020-03-04 ingest notes to date | Bryan Newbold | 2020-04-06 | 1 | -0/+395 | 
| | | |||||
| * | unpaywall ingest notes update | Bryan Newbold | 2020-03-30 | 1 | -0/+138 | 
| | | |||||
| * | unpaywall large ingest notes | Bryan Newbold | 2020-03-17 | 1 | -0/+10 | 
| | | |||||
| * | more unpaywall ingest notes | Bryan Newbold | 2020-03-05 | 1 | -0/+416 | 
| | | |||||
| * | update (and move) ingest notes | Bryan Newbold | 2020-03-03 | 6 | -0/+480 | 
| | | |||||
| * | ingest backfill notes | Bryan Newbold | 2020-02-24 | 3 | -0/+150 | 
| | | |||||
| * | jan 2020 bulk ingest notes | Bryan Newbold | 2020-02-12 | 1 | -0/+26 | 
| | | |||||
| * | add notes on recent ingest and backfill tasks | Bryan Newbold | 2020-02-05 | 3 | -0/+221 | 
| | | |||||
| * | hadoop job log rename and update | Bryan Newbold | 2019-12-27 | 1 | -0/+25 | 
| | | |||||
| * | update job log with pig runs | Bryan Newbold | 2019-12-26 | 1 | -0/+10 | 
| | | |||||
