Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | rust: expand valid hdl identifiers | Bryan Newbold | 2021-10-13 | 1 | -3/+7 | |
| | | | | Based on real examples found on dblp.org | |||||
* | dblp import: basic support for handles as identifiers | Bryan Newbold | 2021-10-13 | 1 | -1/+5 | |
| | ||||||
* | python: normalization/validation support for handle identifiers (hdl) | Bryan Newbold | 2021-10-13 | 1 | -0/+33 | |
| | ||||||
* | dblp import: fix typos in identifier parsing | Bryan Newbold | 2021-10-13 | 1 | -2/+1 | |
| | ||||||
* | guide updates for v0.4 schema changes | Bryan Newbold | 2021-10-13 | 3 | -12/+57 | |
| | ||||||
* | CHANGELOG updates for v0.4 release | Bryan Newbold | 2021-10-13 | 1 | -1/+18 | |
| | ||||||
* | python: partial importer utilization of new schema changes | Bryan Newbold | 2021-10-13 | 3 | -6/+18 | |
| | ||||||
* | python: test coverage of rust schema changes | Bryan Newbold | 2021-10-13 | 4 | -2/+59 | |
| | ||||||
* | python: implement ES schema changes | Bryan Newbold | 2021-10-13 | 1 | -4/+17 | |
| | ||||||
* | web: implement new schema changes | Bryan Newbold | 2021-10-13 | 6 | -11/+45 | |
| | ||||||
* | elasticsearch schema changes | Bryan Newbold | 2021-10-13 | 2 | -3/+13 | |
| | ||||||
* | rust: partial test coverage of schema changes | Bryan Newbold | 2021-10-13 | 1 | -2/+32 | |
| | ||||||
* | rust: prep for possible DOI lowercase enforcement | Bryan Newbold | 2021-10-13 | 1 | -1/+5 | |
| | | | | | | See also: https://github.com/internetarchive/fatcat/issues/83 This commit is no behavior change, just leaving a note to self. | |||||
* | rust: implement scheman and API changes | Bryan Newbold | 2021-10-13 | 5 | -38/+353 | |
| | ||||||
* | rust: handle new migrations in test helper | Bryan Newbold | 2021-10-13 | 1 | -1/+1 | |
| | ||||||
* | rust: implement recent SQL changes | Bryan Newbold | 2021-10-13 | 2 | -0/+12 | |
| | ||||||
* | fatcatd: display version correctly, and at startup | Bryan Newbold | 2021-10-13 | 1 | -2/+8 | |
| | ||||||
* | python client: codegen for v0.4 | Bryan Newbold | 2021-10-13 | 8 | -25/+325 | |
| | ||||||
* | python client: bump version in codegen script | Bryan Newbold | 2021-10-13 | 1 | -1/+1 | |
| | ||||||
* | fatcat-api: enforce more release ext_id checks at create/update | Bryan Newbold | 2021-10-13 | 1 | -2/+15 | |
| | | | | Not enforcing these was a serious bug! | |||||
* | sql: v0.4 schema implementation (as diesel migration) | Bryan Newbold | 2021-10-13 | 2 | -0/+58 | |
| | ||||||
* | bump rust code version to v0.4.0 | Bryan Newbold | 2021-10-13 | 3 | -5/+7 | |
| | ||||||
* | rust codegen for v0.4 | Bryan Newbold | 2021-10-13 | 8 | -25/+307 | |
| | ||||||
* | schema: implement v0.4 tweaks, and bump version number | Bryan Newbold | 2021-10-13 | 1 | -2/+71 | |
| | ||||||
* | update proposals for v0.4 and (hypothetical) v0.5 | Bryan Newbold | 2021-10-13 | 2 | -4/+35 | |
| | ||||||
* | update stats | Bryan Newbold | 2021-10-11 | 3 | -0/+48 | |
| | ||||||
* | another vanished content example | Bryan Newbold | 2021-10-07 | 1 | -0/+7 | |
| | ||||||
* | Merge branch 'bnewbold-ingest-tweaks' into 'master' | bnewbold | 2021-10-02 | 5 | -39/+142 | |
|\ | | | | | | | | | ingest importer behavior tweaks See merge request webgroup/fatcat!120 | |||||
| * | update changelog with notable ingest importer tweaks | Bryan Newbold | 2021-10-01 | 1 | -0/+3 | |
| | | ||||||
| * | kafka import: optional 'force-flush' mode for some importers | Bryan Newbold | 2021-10-01 | 2 | -0/+16 | |
| | | | | | | | | Behavior and motivation described in the kafka json import comment. | |||||
| * | new SPN web (html) importer | Bryan Newbold | 2021-10-01 | 3 | -27/+111 | |
| | | ||||||
| * | ingest importer behavior tweaks | Bryan Newbold | 2021-10-01 | 1 | -8/+8 | |
| | | | | | | | | | | - change order of 'want()' checks, so that result counts are clearer - don't require GROBID success for file imports with SPN | |||||
| * | importer common: more verbose logging (with counts) | Bryan Newbold | 2021-10-01 | 1 | -4/+4 | |
| | | ||||||
* | | Merge branch 'martin-datacite-emtpy-abstract-sentry-94639' into 'master' | bnewbold | 2021-10-02 | 4 | -2/+95 | |
|\ \ | |/ |/| | | | | | datacite: skip empty abstracts See merge request webgroup/fatcat!119 | |||||
| * | datacite: skip empty abstracts | Martin Czygan | 2021-10-01 | 4 | -2/+95 | |
|/ | | | | | Do not add abstracts where `clean` results in the empty string - this violates a constraint: `either abstract_sha1 or content is required` | |||||
* | default ingest request topic now '-daily'; configurable for ingest_tool.py | Bryan Newbold | 2021-09-30 | 4 | -4/+9 | |
| | ||||||
* | Merge branch 'martin-pubmed-ftp-extramuros' into 'master' | Martin Czygan | 2021-09-09 | 1 | -24/+21 | |
|\ | | | | | | | | | pubmed: workaround a networking issue See merge request webgroup/fatcat!118 | |||||
| * | pubmed: workaround a networking issue | Martin Czygan | 2021-09-09 | 1 | -24/+21 | |
| | | | | | | | | | | | | use an http proxy (https://github.com/miku/ftpup) to fetch files from FTP, keep some retry logic; also, hardcoding the proxy path as this should be a temporary workaround | |||||
* | | trivial blank line lint | Bryan Newbold | 2021-09-08 | 1 | -1/+0 | |
| | | ||||||
* | | Merge branch 'master' of git.archive.org:webgroup/fatcat | Bryan Newbold | 2021-09-08 | 1 | -2/+31 | |
|\| | ||||||
| * | Merge branch 'martin-pubmed-use-lftp' into 'master' | Martin Czygan | 2021-09-08 | 1 | -2/+31 | |
| |\ | | | | | | | | | | | | | pubmed: add option to ftp download with lftp See merge request webgroup/fatcat!117 | |||||
| | * | pubmed: add option to ftp download with lftp | Martin Czygan | 2021-09-08 | 1 | -2/+31 | |
| |/ | | | | | | | | | lftp is a classic command line ftp client, and we hope that its retry capabilities are enough of a workaround for the current networking issue | |||||
* / | sql_dumps: set collection at upload time | Bryan Newbold | 2021-09-02 | 1 | -2/+5 | |
|/ | ||||||
* | Merge branch 'martin-pubmed-eof-sentry-92151' into 'master' | Martin Czygan | 2021-08-21 | 1 | -8/+21 | |
|\ | | | | | | | | | pubmed harvester: add basic retry logic See merge request webgroup/fatcat!116 | |||||
| * | pubmed harvester: add basic retry logic | Martin Czygan | 2021-08-20 | 1 | -8/+21 | |
|/ | | | | | | | | Related to a previous issue with seemingly random EOFError from FTP connections, this patch wrap "ftpretr" helper function with a basic retry. Refs: fatcat-workers/issues/92151, fatcat-workers/issues/91102 | |||||
* | guide: remove accidental duplicated background section | Bryan Newbold | 2021-08-18 | 1 | -9/+0 | |
| | ||||||
* | cgraph -> refcat | Bryan Newbold | 2021-08-13 | 2 | -2/+2 | |
| | ||||||
* | web: fix stats rowspan (oops) | Bryan Newbold | 2021-08-12 | 1 | -1/+1 | |
| | ||||||
* | web: remove confusing 'references' row from stats table | Bryan Newbold | 2021-08-12 | 1 | -3/+0 | |
| | | | | Now that we have refcat, which is a different number | |||||
* | Merge branch 'martin-guide-ref-minor-tweaks' into 'master' | bnewbold | 2021-08-09 | 1 | -3/+4 | |
|\ | | | | | | | | | guide: reference graph, minor tweaks See merge request webgroup/fatcat!115 |