Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | improvements to fuzzy refs view | Bryan Newbold | 2021-07-23 | 3 | -47/+75 |
| | | | | | | | | - fixes to release summary macro - show tab counts correctly by re-using generic entity get helper - table styling; 'prev' link - openlibrary access links - parse-and-match button for unmatched+unstructured refs | ||||
* | fixes for newer ref index | Bryan Newbold | 2021-07-23 | 2 | -50/+11 |
| | |||||
* | web: inbound/outbound refs as links (temporarily); change URL names | Bryan Newbold | 2021-07-23 | 3 | -3/+7 |
| | |||||
* | web: initial implementation of fuzzy citation parsing and matching tool | Bryan Newbold | 2021-07-23 | 3 | -0/+173 |
| | |||||
* | references: refactor to point to access_options transform; comment out CSL ↵ | Bryan Newbold | 2021-07-23 | 1 | -57/+8 |
| | | | | fields | ||||
* | partial access options transform for releases | Bryan Newbold | 2021-07-23 | 1 | -0/+58 |
| | |||||
* | web: template macro to display release entry summary | Bryan Newbold | 2021-07-23 | 1 | -0/+52 |
| | |||||
* | first iteration of basic citation inbound/outbound views | Bryan Newbold | 2021-07-23 | 3 | -1/+146 |
| | |||||
* | initial inbound/outbound reference query helpers | Bryan Newbold | 2021-07-23 | 1 | -0/+450 |
| | |||||
* | pubmed: update docs | Martin Czygan | 2021-07-17 | 1 | -2/+3 |
| | |||||
* | pubmed: do not fail when accessing missing file | Martin Czygan | 2021-07-17 | 1 | -2/+8 |
| | | | | | | | after a sync gap (e.g. 06/07 2021) harvester wanted to fetch a file, that was not on the server (any more) - do not fail in this case we'll need to backfill missing records via full data dump | ||||
* | pubmed: reconnect on error | Martin Czygan | 2021-07-16 | 1 | -4/+30 |
| | | | | | | | | | ftp retrieval would run but fail with EOFError on /pubmed/updatefiles/pubmed21n1328_stats.html - not able to find the root cause; using a fresh client, the exact same file would work just fine. So when we retry, we reconnect on failure. Refs: sentry #91102. | ||||
* | web: fix flask/werkzeug encoding for mediawiki oauth | Bryan Newbold | 2021-07-13 | 1 | -1/+4 |
| | |||||
* | web: fix missing ext_ids default for deleted entity view | Bryan Newbold | 2021-07-13 | 1 | -1/+1 |
| | |||||
* | web: fix 'file' entity edit form links | Bryan Newbold | 2021-07-02 | 1 | -1/+1 |
| | |||||
* | web: missing trailing parens | Bryan Newbold | 2021-07-02 | 1 | -1/+1 |
| | |||||
* | web: PMCID external link improvement | Bryan Newbold | 2021-07-02 | 2 | -2/+2 |
| | |||||
* | Merge branch 'bnewbold-more-doi-lower' into 'master' | Martin Czygan | 2021-07-02 | 3 | -3/+8 |
|\ | | | | | | | | | more consistent and defensive lower-casing of DOIs See merge request webgroup/fatcat!109 | ||||
| * | more consistent and defensive lower-casing of DOIs | Bryan Newbold | 2021-06-23 | 3 | -3/+8 |
| | | | | | | | | | | | | | | After noticing more upper/lower ambiguity in production. In particular, we have some old ingest requests in sandcrawler DB, which get re-submitted/re-tried, which have capitalized DOIs in the link source id field. | ||||
* | | tests: small citeproc style changes (to match Pipfile.lock update) | Bryan Newbold | 2021-06-23 | 2 | -3/+4 |
| | | |||||
* | | pipenv: regenerate lock file | Bryan Newbold | 2021-06-23 | 1 | -26/+68 |
| | | |||||
* | | pipenv: add pydantic; add surt; narrow dynaconf | Bryan Newbold | 2021-06-23 | 1 | -1/+3 |
|/ | |||||
* | datacite: more careful title string access; fixes sentry #88350 | Martin Czygan | 2021-06-11 | 4 | -2/+97 |
| | | | | | Caused by a partial "title entry without title" coming *first* (e.g. just holding, e.g. a language, like: {'lang': 'da'} | ||||
* | clean_doi() should lower-case returned DOI | Bryan Newbold | 2021-06-07 | 1 | -1/+4 |
| | | | | | | | | | | Code in a number of places (including Pubmed importer) assumed that this was already lower-casing DOIs, resulting in some broken metadata getting created. See also: https://github.com/internetarchive/fatcat/issues/83 This is just the first step of mitigation. | ||||
* | web: fix DOAJ article links (remove trailing slash) | Bryan Newbold | 2021-06-04 | 1 | -1/+1 |
| | |||||
* | dblp tests: skip redundant seek(0) | Bryan Newbold | 2021-06-03 | 1 | -6/+1 |
| | |||||
* | ingest: swap ingest and file checks, to result in clearer stats/counts of ↵ | Bryan Newbold | 2021-06-03 | 1 | -2/+2 |
| | | | | skipping | ||||
* | ingest: don't accept mag and s2 URLs | Bryan Newbold | 2021-06-03 | 1 | -4/+4 |
| | |||||
* | bump fuzzycat dependency to 0.1.21 | Bryan Newbold | 2021-06-02 | 2 | -20/+18 |
| | |||||
* | web: fix spacing for doaj/dblp identifiers in SERP | Bryan Newbold | 2021-05-31 | 1 | -1/+1 |
| | |||||
* | ingest: don't 'track_total_hits' for ES 7.x count() | Bryan Newbold | 2021-05-31 | 1 | -1/+1 |
| | |||||
* | web: bugfix dblp vs. doaj display logic | Bryan Newbold | 2021-05-31 | 1 | -1/+1 |
| | |||||
* | update fuzzycat to 0.1.20 | Bryan Newbold | 2021-05-31 | 2 | -31/+94 |
| | |||||
* | makefile: add pylint -E invocation to 'make lint', to match CI | Bryan Newbold | 2021-05-25 | 1 | -0/+1 |
| | |||||
* | skip pylint on 'assigning-non-slot' warnings in Flask 2.0 | Bryan Newbold | 2021-05-25 | 1 | -2/+2 |
| | | | | | | | | | | | | | | The 'permanent' field is still valid to set to a boolean in Flask 2.0; not sure why pylint is unhappy in CI (causing test failures). Don't see any problem running test suite locally. Flask API docs: https://flask.palletsprojects.com/en/2.0.x/api/?highlight=permanent#flask.session.permanent And code (recent master branch): https://github.com/pallets/flask/blob/4240ace59710d86c478111affd4ad6fb4c8cad9e/src/flask/sessions.py#L20 | ||||
* | changelog worker: fix file/fileset typo, caught by lint | Bryan Newbold | 2021-05-25 | 1 | -1/+1 |
| | | | | | This would have been resulting in some releases not getting re-indexed into search. | ||||
* | small python lint fixes (no behavior change) | Bryan Newbold | 2021-05-25 | 5 | -6/+4 |
| | |||||
* | bump Flask to 2.x; other deps upgraded automatically | Bryan Newbold | 2021-05-21 | 2 | -152/+167 |
| | |||||
* | ingest: add per-container ingest type overrides | Bryan Newbold | 2021-05-21 | 2 | -1/+23 |
| | |||||
* | fix arabesque sqlite3 examples to have 14-digit timestamps | Bryan Newbold | 2021-05-21 | 1 | -0/+0 |
| | |||||
* | arabesque importer: ensure full 14-digit timestamps | Bryan Newbold | 2021-05-21 | 1 | -1/+3 |
| | |||||
* | Andrew W. Mellon Foundation | Bryan Newbold | 2021-05-18 | 2 | -3/+3 |
| | |||||
* | Merge branch 'bnewbold-pipenv-cleanup' into 'master' | bnewbold | 2021-04-23 | 2 | -327/+277 |
|\ | | | | | | | | | pipenv cleanup See merge request webgroup/fatcat!104 | ||||
| * | pipenv: re-lock project | Bryan Newbold | 2021-04-19 | 1 | -301/+253 |
| | | |||||
| * | pipenv: constrain most package versions to at least major | Bryan Newbold | 2021-04-19 | 1 | -24/+24 |
| | | | | | | | | | | | | | | | | | | Don't have a complete policy with this change, just locking things down a bit more so small package additions and updates don't end up upgrading some small dependency to a major new backwards-incompatible version. Also, correct bs4 -> beautifulsoup4 (bs4 is the import name, not the package name) | ||||
| * | pipenv: remove unused pg-view and pykafka libraries | Bryan Newbold | 2021-04-19 | 1 | -2/+0 |
| | | |||||
* | | web: fix edit form style guide links | Bryan Newbold | 2021-04-20 | 2 | -4/+4 |
|/ | |||||
* | transforms: fix 'display_ame' typo | Bryan Newbold | 2021-04-19 | 1 | -2/+2 |
| | |||||
* | web: expand release creators in more situations | Bryan Newbold | 2021-04-19 | 2 | -2/+2 |
| | |||||
* | fix public API links | Bryan Newbold | 2021-04-15 | 1 | -2/+2 |
| |