index
:
chocula
master
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Commit message (
Expand
)
Author
Age
Files
Lines
*
update TODO
Bryan Newbold
2020-10-08
1
-8
/
+7
*
sources: update metadata snapshot
Bryan Newbold
2020-10-08
1
-2
/
+2
*
estimate coverage change from new coverage holdings
Bryan Newbold
2020-10-08
1
-0
/
+22
*
database support for scholarsportal and cariniana preservation holdings
Bryan Newbold
2020-10-08
7
-1
/
+198
*
make: entrez.txt, not entrez.csv
Bryan Newbold
2020-10-08
1
-2
/
+2
*
vanished_inactive: more tolerant handling of unicode BOM
Bryan Newbold
2020-10-08
1
-1
/
+2
*
basic ONIX XML-to-JSON converter
Bryan Newbold
2020-10-08
1
-0
/
+151
*
fix typo in sources
Bryan Newbold
2020-10-08
1
-1
/
+1
*
util: parse ISSN format with extra spaces
Bryan Newbold
2020-09-13
1
-0
/
+2
*
update vanished journal importer for 2020-09-03 dataset
Bryan Newbold
2020-09-13
6
-82
/
+92
*
update notes and explore
Bryan Newbold
2020-09-03
2
-1
/
+26
*
notes on hathitrust importer
Bryan Newbold
2020-09-02
1
-0
/
+58
*
update sources (dates)
Bryan Newbold
2020-09-02
1
-4
/
+4
*
do not create hathitrust-only journal rows
Bryan Newbold
2020-09-02
1
-1
/
+2
*
hathitrust KBART-style importer
Bryan Newbold
2020-09-02
7
-2
/
+152
*
commit notes on size/scale of OJS ecosystem
Bryan Newbold
2020-08-31
1
-0
/
+8
*
include pkp_pln as a kbart directory in summarization/export/etc
Bryan Newbold
2020-08-31
1
-1
/
+1
*
notes on PKP PLN addition
Bryan Newbold
2020-08-31
1
-0
/
+13
*
fmt
Bryan Newbold
2020-08-31
3
-12
/
+29
*
add makefile/sources support for PKP PLN
Bryan Newbold
2020-08-31
2
-2
/
+11
*
add support for PKP PLN (KBART-like)
Bryan Newbold
2020-08-31
5
-1
/
+139
*
fix img typo
Bryan Newbold
2020-08-19
1
-1
/
+1
*
bump sources date
Bryan Newbold
2020-08-03
1
-2
/
+2
*
fatcat export improvements
Bryan Newbold
2020-08-03
1
-9
/
+28
*
more blocked URLs and domains
Bryan Newbold
2020-08-03
1
-0
/
+29
*
directories: all extra metadata in top-level dict
Bryan Newbold
2020-08-03
4
-13
/
+9
*
sim: some flag fields as boolean
Bryan Newbold
2020-08-03
1
-2
/
+12
*
doaj bug: wasn't setting extra directory metadata
Bryan Newbold
2020-08-03
1
-9
/
+8
*
brief not on how many remaining missing longtail homepages
Bryan Newbold
2020-07-08
1
-0
/
+3
*
sources: automated updates, plus container+homepage stats/status
Bryan Newbold
2020-07-08
1
-4
/
+4
*
update reports
Bryan Newbold
2020-07-08
2
-6
/
+1245
*
remove trailing whitespace from comment
Bryan Newbold
2020-06-25
1
-7
/
+7
*
small improvements to check URL script
Bryan Newbold
2020-06-25
1
-2
/
+2
*
improvements to Makefile stats/status commands
Bryan Newbold
2020-06-25
1
-2
/
+2
*
update TODO
Bryan Newbold
2020-06-23
1
-21
/
+15
*
update notes about longtail homepage URLs
Bryan Newbold
2020-06-23
2
-3
/
+112
*
updated report HTML
Bryan Newbold
2020-06-23
1
-0
/
+1172
*
add MAG importer; reorder directory class listing
Bryan Newbold
2020-06-23
5
-10
/
+110
*
block some meta strings
Bryan Newbold
2020-06-23
1
-0
/
+3
*
skip umi.com in addition to www.umi.com
Bryan Newbold
2020-06-23
1
-0
/
+1
*
commit notes and issnl_prefix.py helper script
Bryan Newbold
2020-06-23
4
-0
/
+157
*
road: proper language parsing
Bryan Newbold
2020-06-23
1
-2
/
+6
*
ensure lang is len()==2; prep for original_name column
Bryan Newbold
2020-06-23
1
-0
/
+5
*
make fmt
Bryan Newbold
2020-06-23
1
-34
/
+39
*
update sources snapshot
Bryan Newbold
2020-06-23
1
-2
/
+2
*
flake8: ignore comment w/o space
Bryan Newbold
2020-06-23
1
-1
/
+1
*
expand test coverage to kbart, summarize
Bryan Newbold
2020-06-23
5
-49
/
+102
*
tests and fixes for parse_lang(), parse_country()
Bryan Newbold
2020-06-23
1
-19
/
+78
*
block/skip more homepage patterns
Bryan Newbold
2020-06-23
1
-0
/
+9
*
fix langs inclusion in summarization; remove unused/duplicate fields
Bryan Newbold
2020-06-23
1
-2
/
+2
[next]