Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
| * | refs: generalize web endpoints; JSON content negotiation; openlibrary ↵ | Bryan Newbold | 2021-07-23 | 4 | -41/+166 | |
| | | | | | | | | inbound view; etc | |||||
| * | refs: change mind about URL structure again | Bryan Newbold | 2021-07-23 | 2 | -7/+7 | |
| | | ||||||
| * | web: refactor refs table into separate refs_macros file | Bryan Newbold | 2021-07-23 | 3 | -74/+127 | |
| | | ||||||
| * | refs: small refactors/tweaks | Bryan Newbold | 2021-07-23 | 1 | -11/+17 | |
| | | ||||||
| * | remove unused imports (lint) | Bryan Newbold | 2021-07-23 | 3 | -8/+4 | |
| | | ||||||
| * | web: always log upstream errors (may be redundant) | Bryan Newbold | 2021-07-23 | 1 | -0/+2 | |
| | | ||||||
| * | pylint: skip pydantic import check (dynamic/extensions) | Bryan Newbold | 2021-07-23 | 2 | -8/+4 | |
| | | ||||||
| * | refs: refactor web paths; enrich refs as generic; remove old refs link | Bryan Newbold | 2021-07-23 | 5 | -129/+91 | |
| | | ||||||
| * | refs fetch: add some hacks; sort hits | Bryan Newbold | 2021-07-23 | 1 | -6/+16 | |
| | | ||||||
| * | release view: improve biblio metadata display in central column | Bryan Newbold | 2021-07-23 | 1 | -13/+14 | |
| | | ||||||
| * | match UI: improve form layout | Bryan Newbold | 2021-07-23 | 1 | -13/+16 | |
| | | ||||||
| * | improvements to fuzzy refs view | Bryan Newbold | 2021-07-23 | 3 | -47/+75 | |
| | | | | | | | | | | | | | | | | - fixes to release summary macro - show tab counts correctly by re-using generic entity get helper - table styling; 'prev' link - openlibrary access links - parse-and-match button for unmatched+unstructured refs | |||||
| * | fixes for newer ref index | Bryan Newbold | 2021-07-23 | 2 | -50/+11 | |
| | | ||||||
| * | web: inbound/outbound refs as links (temporarily); change URL names | Bryan Newbold | 2021-07-23 | 3 | -3/+7 | |
| | | ||||||
| * | web: initial implementation of fuzzy citation parsing and matching tool | Bryan Newbold | 2021-07-23 | 3 | -0/+173 | |
| | | ||||||
| * | references: refactor to point to access_options transform; comment out CSL ↵ | Bryan Newbold | 2021-07-23 | 1 | -57/+8 | |
| | | | | | | | | fields | |||||
| * | partial access options transform for releases | Bryan Newbold | 2021-07-23 | 1 | -0/+58 | |
| | | ||||||
| * | web: template macro to display release entry summary | Bryan Newbold | 2021-07-23 | 1 | -0/+52 | |
| | | ||||||
| * | first iteration of basic citation inbound/outbound views | Bryan Newbold | 2021-07-23 | 3 | -1/+146 | |
| | | ||||||
| * | initial inbound/outbound reference query helpers | Bryan Newbold | 2021-07-23 | 1 | -0/+450 | |
| | | ||||||
* | | guide: updates to roadmap | Bryan Newbold | 2021-07-27 | 1 | -40/+2 | |
|/ | ||||||
* | Merge branch 'martin-pubmed-fetch-gzip-error' into 'master' | bnewbold | 2021-07-17 | 1 | -4/+11 | |
|\ | | | | | | | | | pubmed: do not fail when accessing missing file See merge request webgroup/fatcat!111 | |||||
| * | pubmed: update docs | Martin Czygan | 2021-07-17 | 1 | -2/+3 | |
| | | ||||||
| * | pubmed: do not fail when accessing missing file | Martin Czygan | 2021-07-17 | 1 | -2/+8 | |
|/ | | | | | | | after a sync gap (e.g. 06/07 2021) harvester wanted to fetch a file, that was not on the server (any more) - do not fail in this case we'll need to backfill missing records via full data dump | |||||
* | Merge branch 'martin-pubmed-eof-sentry-91102' into 'master' | Martin Czygan | 2021-07-16 | 1 | -4/+30 | |
|\ | | | | | | | | | pubmed: reconnect on error See merge request webgroup/fatcat!110 | |||||
| * | pubmed: reconnect on error | Martin Czygan | 2021-07-16 | 1 | -4/+30 | |
|/ | | | | | | | | | ftp retrieval would run but fail with EOFError on /pubmed/updatefiles/pubmed21n1328_stats.html - not able to find the root cause; using a fresh client, the exact same file would work just fine. So when we retry, we reconnect on failure. Refs: sentry #91102. | |||||
* | CHANGELOG updates (unreleased) | Bryan Newbold | 2021-07-13 | 1 | -0/+7 | |
| | ||||||
* | web: fix flask/werkzeug encoding for mediawiki oauth | Bryan Newbold | 2021-07-13 | 1 | -1/+4 | |
| | ||||||
* | web: fix missing ext_ids default for deleted entity view | Bryan Newbold | 2021-07-13 | 1 | -1/+1 | |
| | ||||||
* | web: fix 'file' entity edit form links | Bryan Newbold | 2021-07-02 | 1 | -1/+1 | |
| | ||||||
* | web: missing trailing parens | Bryan Newbold | 2021-07-02 | 1 | -1/+1 | |
| | ||||||
* | web: PMCID external link improvement | Bryan Newbold | 2021-07-02 | 2 | -2/+2 | |
| | ||||||
* | Merge branch 'bnewbold-more-doi-lower' into 'master' | Martin Czygan | 2021-07-02 | 3 | -3/+8 | |
|\ | | | | | | | | | more consistent and defensive lower-casing of DOIs See merge request webgroup/fatcat!109 | |||||
| * | more consistent and defensive lower-casing of DOIs | Bryan Newbold | 2021-06-23 | 3 | -3/+8 | |
| | | | | | | | | | | | | | | After noticing more upper/lower ambiguity in production. In particular, we have some old ingest requests in sandcrawler DB, which get re-submitted/re-tried, which have capitalized DOIs in the link source id field. | |||||
* | | tests: small citeproc style changes (to match Pipfile.lock update) | Bryan Newbold | 2021-06-23 | 2 | -3/+4 | |
| | | ||||||
* | | pipenv: regenerate lock file | Bryan Newbold | 2021-06-23 | 1 | -26/+68 | |
| | | ||||||
* | | pipenv: add pydantic; add surt; narrow dynaconf | Bryan Newbold | 2021-06-23 | 1 | -1/+3 | |
| | | ||||||
* | | old dblp hacking notes | Bryan Newbold | 2021-06-23 | 1 | -0/+72 | |
|/ | ||||||
* | stats snapshot (2021-06-23) | Bryan Newbold | 2021-06-23 | 2 | -0/+47 | |
| | ||||||
* | SQL dumps: more pigz (vs. gzip) for speed | Bryan Newbold | 2021-06-17 | 1 | -2/+2 | |
| | ||||||
* | fatcat_ref ES schema: more doc_values; source_year not source_release_year | Bryan Newbold | 2021-06-17 | 1 | -5/+2 | |
| | ||||||
* | Merge branch 'martin-datacite-none-title-sentry-88350' into 'master' | Martin Czygan | 2021-06-11 | 4 | -2/+97 | |
|\ | | | | | | | | | datacite: more careful title string access; fixes sentry #88350 See merge request webgroup/fatcat!108 | |||||
| * | datacite: more careful title string access; fixes sentry #88350 | Martin Czygan | 2021-06-11 | 4 | -2/+97 | |
|/ | | | | | Caused by a partial "title entry without title" coming *first* (e.g. just holding, e.g. a language, like: {'lang': 'da'} | |||||
* | Merge branch 'bnewbold-clean-doi-lower' into 'master' | Martin Czygan | 2021-06-10 | 1 | -1/+4 | |
|\ | | | | | | | | | clean_doi() should lower-case returned DOI See merge request webgroup/fatcat!107 | |||||
| * | clean_doi() should lower-case returned DOI | Bryan Newbold | 2021-06-07 | 1 | -1/+4 | |
|/ | | | | | | | | | | Code in a number of places (including Pubmed importer) assumed that this was already lower-casing DOIs, resulting in some broken metadata getting created. See also: https://github.com/internetarchive/fatcat/issues/83 This is just the first step of mitigation. | |||||
* | web: fix DOAJ article links (remove trailing slash) | Bryan Newbold | 2021-06-04 | 1 | -1/+1 | |
| | ||||||
* | dblp tests: skip redundant seek(0) | Bryan Newbold | 2021-06-03 | 1 | -6/+1 | |
| | ||||||
* | ingest: swap ingest and file checks, to result in clearer stats/counts of ↵ | Bryan Newbold | 2021-06-03 | 1 | -2/+2 | |
| | | | | skipping | |||||
* | ingest: don't accept mag and s2 URLs | Bryan Newbold | 2021-06-03 | 1 | -4/+4 | |
| | ||||||
* | update dblp pre-import notes and pipenv python version (3.8) | Bryan Newbold | 2021-06-03 | 2 | -6/+11 | |
| |