summaryrefslogtreecommitdiffstats
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* more python normalizers, and move from importer commonBryan Newbold2020-11-192-154/+326
* initial implementation of DOAJ importerBryan Newbold2020-11-194-0/+387
* html ingest: actual xhtml mimetypeBryan Newbold2020-11-161-2/+2
* ingest tool: support for setting ingest typeBryan Newbold2020-11-062-6/+10
* html ingest: remaining implementationBryan Newbold2020-11-061-22/+19
* ingest: fix XML ingest test fileBryan Newbold2020-11-051-1/+1
* ingest: progress on HTML ingestBryan Newbold2020-11-053-16/+74
* ingest: initial 'web' worker implementationBryan Newbold2020-11-053-67/+301
* refactor: white/black -> allow/blockBryan Newbold2020-11-051-4/+4
* ingest: whitelist -> allowlistBryan Newbold2020-11-052-6/+6
* ingest: tests for basic XML ingestBryan Newbold2020-11-052-0/+18
* ingest: basic checks for ingest_typeBryan Newbold2020-11-053-4/+36
* normalizer: filter out a specific non-ASCII character in DOIBryan Newbold2020-11-041-1/+3
* entity updates: don't ingest JSTOR DOI prefixesBryan Newbold2020-10-231-0/+2
* entity updater: new work update feed (ident and changelog metadata only)Bryan Newbold2020-10-162-2/+26
* container coverage: add keeper link and KBART holdings listBryan Newbold2020-10-131-0/+11
* release view: remove abiguous OA status indicatorBryan Newbold2020-10-131-4/+0
* container view: fix non-OA empty boxBryan Newbold2020-10-131-3/+3
* coverage: show counts and fraction in tooltip of coverage barsBryan Newbold2020-10-131-5/+5
* chocula importer: small tweaks to update behaviorBryan Newbold2020-10-081-8/+6
* elastic transform: more preservation keepersBryan Newbold2020-10-081-1/+2
* Merge branch 'bnewbold-202009-polish' into 'master'Martin Czygan2020-09-2910-124/+159
|\
| * coverage: handle the case of hits, but none with yearsBryan Newbold2020-09-171-4/+5
| * web: handle unknown CSL style as a cleaner 400 pageBryan Newbold2020-09-172-1/+7
| * web: update sub-resource integrity and pre-loadingBryan Newbold2020-09-171-0/+13
| * lint cleanupsBryan Newbold2020-09-172-3/+0
| * web: route constraints on fcids and UUIDsBryan Newbold2020-09-172-101/+103
| * container view: only show OA indicator when knownBryan Newbold2020-09-171-5/+1
| * web container view: hide preservation when no releasesBryan Newbold2020-09-171-8/+6
| * web toml editing: remove sub-entities from TOMLBryan Newbold2020-09-171-0/+4
| * coverage search: pretty display for ES query errorsBryan Newbold2020-09-172-1/+19
| * coverage: clarify available/accessible terminologyBryan Newbold2020-09-171-1/+1
* | update keepers links to keepers.issn.orgBryan Newbold2020-09-282-8/+8
* | address spammy datacite titlesMartin Czygan2020-09-232-0/+25
|/
* homepage: small grammar tweaks (The/the)Bryan Newbold2020-09-111-3/+3
* ingest: default to crawl protocols.io DOIsBryan Newbold2020-09-101-0/+2
* datacite: handle case of empty-string versionBryan Newbold2020-09-103-2/+3
* remove spurious print statementBryan Newbold2020-09-031-1/+0
* generic file entity clean-ups as part of file_meta importerBryan Newbold2020-09-023-0/+149
* Merge branch 'bnewbold-filemeta'Bryan Newbold2020-08-275-0/+162
|\
| * fix comment typo (thanks martin)Bryan Newbold2020-08-271-1/+1
| * fixes and test coverage for file_meta importerBryan Newbold2020-08-214-6/+82
| * initial implementation of file_meta importerBryan Newbold2020-08-213-0/+86
* | remove typo (isbn:) from metadata DC.language fieldBryan Newbold2020-08-211-1/+1
* | remove placeholder description meta tagBryan Newbold2020-08-201-1/+0
|/
* fix SearchAction nesting in WebSite (schema.org)Bryan Newbold2020-08-201-5/+2
* sitemap fixes from testingBryan Newbold2020-08-191-5/+5
* update robots.txt and sitemap.xmlBryan Newbold2020-08-194-2/+52
* entity updater: handle doi=None case betterBryan Newbold2020-08-141-1/+1
* entity updater: es['publisher_type'] not always setBryan Newbold2020-08-141-1/+1