index
:
sandcrawler
bnewbold-args
bnewbold-backfill
bnewbold-persist-grobid-errors
bnewbold-refactor-loggging
master
trawler
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
scalding
/
src
/
main
/
scala
/
sandcrawler
/
GrobidScorable.scala
Commit message (
Expand
)
Author
Age
Files
Lines
*
clean up commented out code in scalding/
Bryan Newbold
2018-08-24
1
-3
/
+2
*
author parsing (and year, for crossref)
Bryan Newbold
2018-08-23
1
-1
/
+13
*
Added title length filtering to GrobidScorable
Ellen Spertus
2018-08-22
1
-0
/
+16
*
use grobid0:metadata, not tei_json
Bryan Newbold
2018-08-21
1
-5
/
+5
*
Created static factory method for ScorableCreations to deal with null.
Ellen Spertus
2018-08-20
1
-1
/
+1
*
handle null status_code lines
Bryan Newbold
2018-08-15
1
-0
/
+1
*
grobid scoring: status_code as signed int, not string
Bryan Newbold
2018-08-15
1
-2
/
+7
*
Now ignores grobid entries with status other than 200.
Ellen Spertus
2018-08-14
1
-3
/
+7
*
Factored out ScorableFeatures.
Ellen Spertus
2018-08-13
1
-5
/
+1
*
Pipeline works, all tests pass, no scalastyle errors.
Ellen Spertus
2018-08-13
1
-2
/
+1
*
It compiles.
Ellen Spertus
2018-08-11
1
-11
/
+10
*
It compiles
Ellen Spertus
2018-08-10
1
-3
/
+4
*
Broken code to share with Bryan.
Ellen Spertus
2018-08-09
1
-1
/
+1
*
WIP
Ellen Spertus
2018-08-09
1
-2
/
+3
*
WIP
Ellen Spertus
2018-08-09
1
-4
/
+5
*
Removed implicit parameters. Does not compile.
Ellen Spertus
2018-08-09
1
-1
/
+1
*
WIP
Ellen Spertus
2018-08-09
1
-8
/
+7
*
Fixed scalastyle violations.
Ellen Spertus
2018-08-09
1
-12
/
+9
*
Removed HBaseCrossrefScore{Job,Test} and references thereto.
Ellen Spertus
2018-08-07
1
-3
/
+5
*
Added GrobidScorableTest, minor improvements.
Ellen Spertus
2018-08-07
1
-9
/
+15
*
Added CrossrefScorable.scala. All code compiles.
Ellen Spertus
2018-08-07
1
-8
/
+5
*
New code compiles. Old tests pass. New tests not yet written.
Ellen Spertus
2018-08-06
1
-0
/
+48