Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | dblp import/update notes | Bryan Newbold | 2022-07-19 | 1 | -0/+114 |
| | |||||
* | dblp: updated ingest pipeline | Bryan Newbold | 2022-07-19 | 6 | -7/+213 |
| | |||||
* | cleanup: DOAJ missing container_id | Bryan Newbold | 2022-07-12 | 2 | -0/+41 |
| | |||||
* | document bulk chocula update | Bryan Newbold | 2022-07-06 | 2 | -0/+30 |
| | |||||
* | update stats | Bryan Newbold | 2022-07-06 | 3 | -0/+50 |
| | |||||
* | bulk dumps: update Makefile with bugfixes | Bryan Newbold | 2022-04-26 | 1 | -28/+28 |
| | |||||
* | fix some more isiarticles (with :80 in URL) | Bryan Newbold | 2022-04-20 | 2 | -0/+18 |
| | |||||
* | bulk edits: docs on initial dataset/fileset ingest | Bryan Newbold | 2022-04-20 | 1 | -0/+22 |
| | |||||
* | cleanups: isiarticles | Bryan Newbold | 2022-04-20 | 3 | -0/+49 |
| | |||||
* | stats: just as unpaywall bulk ingest starting | Bryan Newbold | 2022-04-19 | 1 | -0/+1 |
| | |||||
* | dump/export helper Makefile | Bryan Newbold | 2022-04-18 | 1 | -0/+93 |
| | |||||
* | container status: add simple prod single-command script | Bryan Newbold | 2022-04-08 | 1 | -0/+20 |
| | |||||
* | 2022-03-21 fatcat stats | Bryan Newbold | 2022-03-22 | 2 | -0/+48 |
| | |||||
* | document recent bulk metadata edits/imports | Bryan Newbold | 2022-03-22 | 3 | -0/+62 |
| | |||||
* | Merge branch 'bnewbold-container-web' into 'master' | bnewbold | 2022-03-10 | 1 | -0/+6 |
|\ | | | | | | | | | container web interface improvements See merge request webgroup/fatcat!140 | ||||
| * | container ES schema: more aliases | Bryan Newbold | 2022-02-09 | 1 | -0/+6 |
| | | |||||
* | | sql dumps: use 'custom' mode instead of 'tar' | Bryan Newbold | 2022-02-23 | 1 | -1/+5 |
|/ | |||||
* | bulk cleanups: NCI chem entries; IRs with container_id; PLOS non-articles | Bryan Newbold | 2022-02-09 | 4 | -0/+330 |
| | |||||
* | bulk metadata edit log | Bryan Newbold | 2022-02-04 | 3 | -0/+223 |
| | |||||
* | commit updated stats | Bryan Newbold | 2022-01-26 | 2 | -0/+47 |
| | |||||
* | docker focal: update base image for focal/py38 | Bryan Newbold | 2022-01-26 | 1 | -36/+11 |
| | |||||
* | container counts update process README | Bryan Newbold | 2022-01-21 | 1 | -0/+41 |
| | |||||
* | update stats | Bryan Newbold | 2022-01-12 | 3 | -0/+49 |
| | |||||
* | ES: update README for v05-era indices | Bryan Newbold | 2022-01-12 | 1 | -15/+15 |
| | |||||
* | ES schema: fix typo in container issns alias | Bryan Newbold | 2022-01-12 | 1 | -1/+1 |
| | |||||
* | another file_meta update | Bryan Newbold | 2021-12-06 | 1 | -0/+60 |
| | |||||
* | ES container schema: add 'sim_pubid' and `ia_sim_collection` fields | Bryan Newbold | 2021-12-03 | 1 | -0/+2 |
| | |||||
* | SQL snashots/exports: updated prod commands | Bryan Newbold | 2021-12-03 | 1 | -13/+15 |
| | |||||
* | file_meta cleanup update | Bryan Newbold | 2021-12-01 | 1 | -0/+75 |
| | |||||
* | initial 'far-future' release date updates | Bryan Newbold | 2021-11-30 | 1 | -0/+212 |
| | |||||
* | chocula update notes | Bryan Newbold | 2021-11-30 | 1 | -0/+61 |
| | |||||
* | container ISSN-L dedupe notes | Bryan Newbold | 2021-11-30 | 1 | -0/+198 |
| | |||||
* | add stats (before re-indexing), and rename old files for consistency | Bryan Newbold | 2021-11-30 | 6 | -0/+47 |
| | |||||
* | cleanups: springer 'page-one' sample PDFs | Bryan Newbold | 2021-11-29 | 2 | -0/+129 |
| | |||||
* | cleanups: truncated wayback PDFs from common crawl | Bryan Newbold | 2021-11-29 | 2 | -0/+292 |
| | |||||
* | update to truncated wayback timestamp issue | Bryan Newbold | 2021-11-29 | 1 | -0/+24 |
| | |||||
* | update to file short wayback timestamp cleanup | Bryan Newbold | 2021-11-29 | 2 | -1/+30 |
| | |||||
* | commit old 2021-11-11 stats file | Bryan Newbold | 2021-11-29 | 1 | -0/+1 |
| | |||||
* | clean up extra/ folder a bit | Bryan Newbold | 2021-11-29 | 11 | -24/+0 |
| | |||||
* | move notes/bulk_edits/ to extra/bulk_edits/ | Bryan Newbold | 2021-11-29 | 23 | -0/+1743 |
| | |||||
* | move 'cleanups' directory from notes to extra/ | Bryan Newbold | 2021-11-29 | 11 | -0/+1306 |
| | |||||
* | codespell fixes to various other docs | Bryan Newbold | 2021-11-24 | 3 | -4/+4 |
| | |||||
* | content_scope: include in file ES schema and transform | Bryan Newbold | 2021-11-17 | 1 | -0/+1 |
| | |||||
* | ISSN-L dupes check: output all matches | Bryan Newbold | 2021-11-17 | 1 | -1/+1 |
| | |||||
* | sitemap generation improvements | Bryan Newbold | 2021-11-10 | 2 | -1/+2 |
| | |||||
* | elasticsearch schema changes | Bryan Newbold | 2021-10-13 | 2 | -3/+13 |
| | |||||
* | update stats | Bryan Newbold | 2021-10-11 | 3 | -0/+48 |
| | |||||
* | sql_dumps: set collection at upload time | Bryan Newbold | 2021-09-02 | 1 | -2/+5 |
| | |||||
* | prod stats snapshot | Bryan Newbold | 2021-08-06 | 4 | -0/+47 |
| | |||||
* | stats snapshot (2021-06-23) | Bryan Newbold | 2021-06-23 | 2 | -0/+47 |
| |