Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | make: entrez.txt, not entrez.csv | Bryan Newbold | 2020-10-08 | 1 | -2/+2 | |
| | ||||||
* | vanished_inactive: more tolerant handling of unicode BOM | Bryan Newbold | 2020-10-08 | 1 | -1/+2 | |
| | ||||||
* | basic ONIX XML-to-JSON converter | Bryan Newbold | 2020-10-08 | 1 | -0/+151 | |
| | ||||||
* | fix typo in sources | Bryan Newbold | 2020-10-08 | 1 | -1/+1 | |
| | ||||||
* | util: parse ISSN format with extra spaces | Bryan Newbold | 2020-09-13 | 1 | -0/+2 | |
| | ||||||
* | update vanished journal importer for 2020-09-03 dataset | Bryan Newbold | 2020-09-13 | 6 | -82/+92 | |
| | ||||||
* | update notes and explore | Bryan Newbold | 2020-09-03 | 2 | -1/+26 | |
| | ||||||
* | notes on hathitrust importer | Bryan Newbold | 2020-09-02 | 1 | -0/+58 | |
| | ||||||
* | update sources (dates) | Bryan Newbold | 2020-09-02 | 1 | -4/+4 | |
| | ||||||
* | do not create hathitrust-only journal rows | Bryan Newbold | 2020-09-02 | 1 | -1/+2 | |
| | ||||||
* | hathitrust KBART-style importer | Bryan Newbold | 2020-09-02 | 7 | -2/+152 | |
| | ||||||
* | commit notes on size/scale of OJS ecosystem | Bryan Newbold | 2020-08-31 | 1 | -0/+8 | |
| | ||||||
* | include pkp_pln as a kbart directory in summarization/export/etc | Bryan Newbold | 2020-08-31 | 1 | -1/+1 | |
| | ||||||
* | notes on PKP PLN addition | Bryan Newbold | 2020-08-31 | 1 | -0/+13 | |
| | ||||||
* | fmt | Bryan Newbold | 2020-08-31 | 3 | -12/+29 | |
| | ||||||
* | add makefile/sources support for PKP PLN | Bryan Newbold | 2020-08-31 | 2 | -2/+11 | |
| | | | | Also more accurate JSTOR URL in sources.toml | |||||
* | add support for PKP PLN (KBART-like) | Bryan Newbold | 2020-08-31 | 5 | -1/+139 | |
| | ||||||
* | fix img typo | Bryan Newbold | 2020-08-19 | 1 | -1/+1 | |
| | ||||||
* | bump sources date | Bryan Newbold | 2020-08-03 | 1 | -2/+2 | |
| | ||||||
* | fatcat export improvements | Bryan Newbold | 2020-08-03 | 1 | -9/+28 | |
| | ||||||
* | more blocked URLs and domains | Bryan Newbold | 2020-08-03 | 1 | -0/+29 | |
| | ||||||
* | directories: all extra metadata in top-level dict | Bryan Newbold | 2020-08-03 | 4 | -13/+9 | |
| | | | | Had been using slug-specific sub-objects, but this was too confusing. | |||||
* | sim: some flag fields as boolean | Bryan Newbold | 2020-08-03 | 1 | -2/+12 | |
| | ||||||
* | doaj bug: wasn't setting extra directory metadata | Bryan Newbold | 2020-08-03 | 1 | -9/+8 | |
| | ||||||
* | brief not on how many remaining missing longtail homepages | Bryan Newbold | 2020-07-08 | 1 | -0/+3 | |
| | ||||||
* | sources: automated updates, plus container+homepage stats/status | Bryan Newbold | 2020-07-08 | 1 | -4/+4 | |
| | ||||||
* | update reports | Bryan Newbold | 2020-07-08 | 2 | -6/+1245 | |
| | ||||||
* | remove trailing whitespace from comment | Bryan Newbold | 2020-06-25 | 1 | -7/+7 | |
| | ||||||
* | small improvements to check URL script | Bryan Newbold | 2020-06-25 | 1 | -2/+2 | |
| | ||||||
* | improvements to Makefile stats/status commands | Bryan Newbold | 2020-06-25 | 1 | -2/+2 | |
| | ||||||
* | update TODO | Bryan Newbold | 2020-06-23 | 1 | -21/+15 | |
| | ||||||
* | update notes about longtail homepage URLs | Bryan Newbold | 2020-06-23 | 2 | -3/+112 | |
| | ||||||
* | updated report HTML | Bryan Newbold | 2020-06-23 | 1 | -0/+1172 | |
| | ||||||
* | add MAG importer; reorder directory class listing | Bryan Newbold | 2020-06-23 | 5 | -10/+110 | |
| | ||||||
* | block some meta strings | Bryan Newbold | 2020-06-23 | 1 | -0/+3 | |
| | ||||||
* | skip umi.com in addition to www.umi.com | Bryan Newbold | 2020-06-23 | 1 | -0/+1 | |
| | ||||||
* | commit notes and issnl_prefix.py helper script | Bryan Newbold | 2020-06-23 | 4 | -0/+157 | |
| | ||||||
* | road: proper language parsing | Bryan Newbold | 2020-06-23 | 1 | -2/+6 | |
| | ||||||
* | ensure lang is len()==2; prep for original_name column | Bryan Newbold | 2020-06-23 | 1 | -0/+5 | |
| | ||||||
* | make fmt | Bryan Newbold | 2020-06-23 | 1 | -34/+39 | |
| | ||||||
* | update sources snapshot | Bryan Newbold | 2020-06-23 | 1 | -2/+2 | |
| | ||||||
* | flake8: ignore comment w/o space | Bryan Newbold | 2020-06-23 | 1 | -1/+1 | |
| | ||||||
* | expand test coverage to kbart, summarize | Bryan Newbold | 2020-06-23 | 5 | -49/+102 | |
| | ||||||
* | tests and fixes for parse_lang(), parse_country() | Bryan Newbold | 2020-06-23 | 1 | -19/+78 | |
| | | | | These were basically entirely broken. Oof! | |||||
* | block/skip more homepage patterns | Bryan Newbold | 2020-06-23 | 1 | -0/+9 | |
| | ||||||
* | fix langs inclusion in summarization; remove unused/duplicate fields | Bryan Newbold | 2020-06-23 | 1 | -2/+2 | |
| | ||||||
* | strip control characters from titles (issn_meta) | Bryan Newbold | 2020-06-23 | 1 | -0/+4 | |
| | ||||||
* | fix issn_meta country detection | Bryan Newbold | 2020-06-23 | 1 | -5/+8 | |
| | ||||||
* | improve lang parsing | Bryan Newbold | 2020-06-23 | 5 | -7/+11 | |
| | ||||||
* | issn_meta: mainTitle can be an array | Bryan Newbold | 2020-06-23 | 1 | -1/+4 | |
| |