Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge branch 'bnewbold-xml-html-ingest' into 'master' | Martin Czygan | 2020-11-19 | 10 | -66/+409 |
|\ | | | | | | | | | HTML webcapture ingest (and XML file ingest) See merge request webgroup/fatcat!88 | ||||
| * | html ingest: actual xhtml mimetype | Bryan Newbold | 2020-11-16 | 1 | -2/+2 |
| | | |||||
| * | ingest tool: support for setting ingest type | Bryan Newbold | 2020-11-06 | 2 | -6/+10 |
| | | |||||
| * | html ingest: remaining implementation | Bryan Newbold | 2020-11-06 | 1 | -22/+19 |
| | | |||||
| * | ingest: fix XML ingest test file | Bryan Newbold | 2020-11-05 | 1 | -1/+1 |
| | | |||||
| * | ingest: progress on HTML ingest | Bryan Newbold | 2020-11-05 | 3 | -16/+74 |
| | | |||||
| * | ingest: initial 'web' worker implementation | Bryan Newbold | 2020-11-05 | 3 | -67/+301 |
| | | |||||
| * | refactor: white/black -> allow/block | Bryan Newbold | 2020-11-05 | 1 | -4/+4 |
| | | |||||
| * | ingest: whitelist -> allowlist | Bryan Newbold | 2020-11-05 | 2 | -6/+6 |
| | | |||||
| * | ingest: tests for basic XML ingest | Bryan Newbold | 2020-11-05 | 2 | -0/+18 |
| | | |||||
| * | ingest: basic checks for ingest_type | Bryan Newbold | 2020-11-05 | 3 | -4/+36 |
|/ | |||||
* | normalizer: filter out a specific non-ASCII character in DOI | Bryan Newbold | 2020-11-04 | 1 | -1/+3 |
| | |||||
* | entity updates: don't ingest JSTOR DOI prefixes | Bryan Newbold | 2020-10-23 | 1 | -0/+2 |
| | |||||
* | Merge branch 'bnewbold-scholar-pipeline' into 'master' | bnewbold | 2020-10-20 | 2 | -2/+26 |
|\ | | | | | | | | | entity updater: new work update feed (ident and changelog metadata only) See merge request webgroup/fatcat!87 | ||||
| * | entity updater: new work update feed (ident and changelog metadata only) | Bryan Newbold | 2020-10-16 | 2 | -2/+26 |
|/ | |||||
* | bulk citation graph workflow proposal | Bryan Newbold | 2020-10-15 | 1 | -0/+160 |
| | |||||
* | Merge branch 'bnewbold-web-tweaks-20201013' into 'master' | Martin Czygan | 2020-10-14 | 4 | -12/+19 |
|\ | | | | | | | | | web coverage tweaks 20201013 See merge request webgroup/fatcat!86 | ||||
| * | container coverage: add keeper link and KBART holdings list | Bryan Newbold | 2020-10-13 | 1 | -0/+11 |
| | | |||||
| * | release view: remove abiguous OA status indicator | Bryan Newbold | 2020-10-13 | 1 | -4/+0 |
| | | |||||
| * | container view: fix non-OA empty box | Bryan Newbold | 2020-10-13 | 1 | -3/+3 |
| | | |||||
| * | coverage: show counts and fraction in tooltip of coverage bars | Bryan Newbold | 2020-10-13 | 1 | -5/+5 |
|/ | |||||
* | update database/table stats | Bryan Newbold | 2020-10-12 | 2 | -0/+48 |
| | |||||
* | chocula importer: small tweaks to update behavior | Bryan Newbold | 2020-10-08 | 1 | -8/+6 |
| | |||||
* | elastic transform: more preservation keepers | Bryan Newbold | 2020-10-08 | 1 | -1/+2 |
| | |||||
* | update 'contributing' page in guide | Bryan Newbold | 2020-10-02 | 5 | -17/+176 |
| | |||||
* | update README | Bryan Newbold | 2020-10-01 | 1 | -41/+51 |
| | |||||
* | updates to CHANGELOG | Bryan Newbold | 2020-10-01 | 1 | -0/+19 |
| | |||||
* | more metadata cleanup task notes | Bryan Newbold | 2020-10-01 | 1 | -0/+7 |
| | |||||
* | Merge branch 'bnewbold-202009-polish' into 'master' | Martin Czygan | 2020-09-29 | 10 | -124/+159 |
|\ | | | | | | | | | fatcat.wiki 2020-09 polish See merge request webgroup/fatcat!84 | ||||
| * | coverage: handle the case of hits, but none with years | Bryan Newbold | 2020-09-17 | 1 | -4/+5 |
| | | |||||
| * | web: handle unknown CSL style as a cleaner 400 page | Bryan Newbold | 2020-09-17 | 2 | -1/+7 |
| | | |||||
| * | web: update sub-resource integrity and pre-loading | Bryan Newbold | 2020-09-17 | 1 | -0/+13 |
| | | | | | | | | For security/integrity and performance | ||||
| * | lint cleanups | Bryan Newbold | 2020-09-17 | 2 | -3/+0 |
| | | |||||
| * | web: route constraints on fcids and UUIDs | Bryan Newbold | 2020-09-17 | 2 | -101/+103 |
| | | | | | | | | | | | | | | | | | | | | | | Instead of accepting any string for these parameters and throwing a 400 error if not the correct type, implement better route matching at the framework level and return more 404s. This resolves several outstanding sentry exceptions. The "flask-uuid" was imported and seems to have been configured for this purpose previously, but I guess I never finished configuring it. | ||||
| * | container view: only show OA indicator when known | Bryan Newbold | 2020-09-17 | 1 | -5/+1 |
| | | | | | | | | | | The "is_oa:False" could be that we just don't know; aren't actually distinguishing between false and blank. | ||||
| * | web container view: hide preservation when no releases | Bryan Newbold | 2020-09-17 | 1 | -8/+6 |
| | | |||||
| * | web toml editing: remove sub-entities from TOML | Bryan Newbold | 2020-09-17 | 1 | -0/+4 |
| | | |||||
| * | coverage search: pretty display for ES query errors | Bryan Newbold | 2020-09-17 | 2 | -1/+19 |
| | | |||||
| * | coverage: clarify available/accessible terminology | Bryan Newbold | 2020-09-17 | 1 | -1/+1 |
| | | |||||
* | | update keepers links to keepers.issn.org | Bryan Newbold | 2020-09-28 | 2 | -8/+8 |
| | | |||||
* | | Merge branch 'martin-datacite-spammy-title' into 'master' | Martin Czygan | 2020-09-22 | 2 | -0/+25 |
|\ \ | |/ |/| | | | | | address spammy datacite titles See merge request webgroup/fatcat!85 | ||||
| * | address spammy datacite titles | Martin Czygan | 2020-09-23 | 2 | -0/+25 |
|/ | | | | | | | | | seemingly from zenodo: * https://fatcat.wiki/release/rzcpjwukobd4pj36ipla22cnoi * https://doi.org/10.5281/zenodo.4041777 About 3400 records with "FULL MOVIE" in title, currently. | ||||
* | homepage: small grammar tweaks (The/the) | Bryan Newbold | 2020-09-11 | 1 | -3/+3 |
| | |||||
* | ingest: default to crawl protocols.io DOIs | Bryan Newbold | 2020-09-10 | 1 | -0/+2 |
| | |||||
* | Merge branch 'bnewbold-datacite-not-empty-version' into 'master' | bnewbold | 2020-09-11 | 3 | -2/+3 |
|\ | | | | | | | | | datacite: handle case of empty-string version See merge request webgroup/fatcat!83 | ||||
| * | datacite: handle case of empty-string version | Bryan Newbold | 2020-09-10 | 3 | -2/+3 |
|/ | | | | | Includes a tiny tweak to the datacite import sample file to test this code path. | ||||
* | file_meta import notes | Bryan Newbold | 2020-09-04 | 1 | -0/+75 |
| | |||||
* | update stats snapshot | Bryan Newbold | 2020-09-03 | 2 | -0/+47 |
| | |||||
* | remove spurious print statement | Bryan Newbold | 2020-09-03 | 1 | -1/+0 |
| | |||||
* | Merge branch 'bnewbold-file-meta-cleanups' into 'master' | Martin Czygan | 2020-09-03 | 3 | -0/+149 |
|\ | | | | | | | | | generic file entity clean-ups as part of file_meta importer See merge request webgroup/fatcat!82 |