index
:
fatcat
bnewbold-doaj-article-harvest
bnewbold-elastic-extras
bnewbold-openapi-client-generator-v601
bnewbold-pythonclient-types
bnewbold-redoc
bnewbold-rust-gen-v5
bnewbold-sitemap
bnewbold-ubuntu-jammy
cockroach
confluent-kafka
master
preview
x-attic-auth-other-macaroon-lib
x-attic-camp
x-attic-changelog-export
x-attic-chocula
x-attic-cockroach
x-attic-golang
x-attic-more-importers
x-attic-preview
x-attic-python-rust-hacks
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
python
/
tests
/
files
Commit message (
Expand
)
Author
Age
Files
Lines
*
pubmed: handle multiple ReferenceList
Bryan Newbold
2020-03-20
1
-0
/
+206
*
Merge branch 'martin-kafka-bs4-import' into 'master'
Martin Czygan
2020-03-10
2
-0
/
+0
|
\
|
*
more pubmed adjustments
Martin Czygan
2020-02-22
2
-0
/
+0
*
|
Merge branch 'bnewbold-elastic-v03b'
Bryan Newbold
2020-02-26
3
-0
/
+3
|
\
\
|
*
|
fix some transform bugs, add some tests
Bryan Newbold
2020-01-29
3
-0
/
+3
*
|
|
shadow import: more filtering of file_meta fields
Bryan Newbold
2020-02-13
1
-12
/
+10
*
|
|
basic shadow importer
Bryan Newbold
2020-02-13
1
-0
/
+12
|
|
/
|
/
|
*
|
datacite: add exception for https://www.micropublication.org/
Martin Czygan
2020-01-31
1
-1
/
+2
*
|
datacite: improve date handling and minor tweak
Martin Czygan
2020-01-30
2
-0
/
+110
|
/
*
do not normalize "en dash" in DOI
Martin Czygan
2020-01-17
1
-1
/
+1
*
ingest: improve tests, support old ingest results
Bryan Newbold
2020-01-15
2
-1
/
+2
*
datacite: ignore known unknown values in resourceType*
Martin Czygan
2020-01-09
2
-0
/
+94
*
datacite: abstracts may be strings or list of strings
Martin Czygan
2020-01-09
4
-0
/
+186
*
datacite: improve license_slug handling
Martin Czygan
2020-01-09
2
-1
/
+3
*
datacite: add 'Unknown' to blacklist
Martin Czygan
2020-01-09
1
-7
/
+1
*
datacite: get rid of schemaVersion
Martin Czygan
2020-01-09
17
-32
/
+14
*
datacite: reformat test cases and use jq . --sort-keys
Martin Czygan
2020-01-08
54
-2299
/
+2301
*
datacite: factor out contributor handling
Martin Czygan
2020-01-08
4
-0
/
+105
*
datacite: adjust tests for release_month
Martin Czygan
2020-01-08
12
-12
/
+12
*
datacite: mark additional files as stub
Martin Czygan
2020-01-08
2
-0
/
+72
*
datacite: CCDC are entries, mostly
Martin Czygan
2020-01-08
1
-1
/
+1
*
datacite: adding datacite-specific extra metadata
Martin Czygan
2020-01-07
30
-1468
/
+1570
*
datacite: month field should be top-level
Martin Czygan
2020-01-06
11
-14
/
+14
*
datacite: include month in extra
Martin Czygan
2020-01-06
11
-11
/
+13
*
datacite: clean abstracts, use unknown value tokens
Martin Czygan
2020-01-06
3
-3
/
+3
*
datacite: always include "datacite" key in extra
Martin Czygan
2020-01-04
14
-26
/
+26
*
datacite: remove --lang-detect flag
Martin Czygan
2020-01-03
5
-10
/
+15
*
datacite: add another test case
Martin Czygan
2020-01-02
2
-0
/
+70
*
datacite: open case for editing after creation
Martin Czygan
2020-01-02
1
-0
/
+2
*
datacite: add helper script to create new test case
Martin Czygan
2020-01-02
1
-0
/
+14
*
datacite: address raw_name index form comment
Martin Czygan
2020-01-02
19
-111
/
+111
*
datacite: add conversion fixtures
Martin Czygan
2020-01-02
49
-0
/
+3924
*
improve datacite field mapping and import
Martin Czygan
2019-12-28
2
-0
/
+1
*
datacite: add simple test and fixture for datacite api interaction
Martin Czygan
2019-12-27
1
-0
/
+1
*
add regression test for medlinedate -> year parsing
Bryan Newbold
2019-12-23
1
-0
/
+95
*
add basic test for crossref harvest API call
Bryan Newbold
2019-12-06
1
-0
/
+1
*
ingest file result importer
Bryan Newbold
2019-11-15
1
-0
/
+1
*
release elasticsearch results: stage not status
Bryan Newbold
2019-06-13
1
-1
/
+1
*
JALC bulk file importer
Bryan Newbold
2019-05-21
1
-0
/
+100
*
basic JALC XML DOI metadata parser
Bryan Newbold
2019-05-21
1
-0
/
+176
*
basic JSTOR XML parser
Bryan Newbold
2019-05-21
1
-0
/
+58
*
basic arxivraw XML parser
Bryan Newbold
2019-05-21
1
-0
/
+31
*
basic pubmed parser
Bryan Newbold
2019-05-21
1
-0
/
+36822
*
fix releases/release_ids in math_universe.json test file
Bryan Newbold
2019-05-20
1
-1
/
+1
*
importer code updates
Bryan Newbold
2019-05-13
1
-1
/
+1
*
update example release JSON to new schema (ext_id, release_stage)
Bryan Newbold
2019-05-13
2
-11
/
+11
*
arabesque import tests
Bryan Newbold
2019-04-18
2
-0
/
+10
*
many web test improvements
Bryan Newbold
2019-04-04
2
-0
/
+2
*
more integration of transform refactor
Bryan Newbold
2019-03-11
1
-0
/
+10
*
crossref import tweaks/fixes
Bryan Newbold
2019-01-29
1
-0
/
+1
[next]