<feed xmlns='http://www.w3.org/2005/Atom'>
<title>fatcat-scholar/schema, branch x-attic-rescore</title>
<subtitle>Unnamed repository; edit this file 'description' to name the repository.
</subtitle>
<id>https://git.bnewbold.net/fatcat-scholar/atom?h=x-attic-rescore</id>
<link rel='self' href='https://git.bnewbold.net/fatcat-scholar/atom?h=x-attic-rescore'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/'/>
<updated>2020-08-07T02:01:12+00:00</updated>
<entry>
<title>ES schema: access_type should be any option, not just 'best'</title>
<updated>2020-08-07T02:01:12+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-08-06T23:10:19+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=dd96b7906e417dd74081e508aaabfebaeac5ece1'/>
<id>urn:sha1:dd96b7906e417dd74081e508aaabfebaeac5ece1</id>
<content type='text'>
</content>
</entry>
<entry>
<title>enable index_phrases on everything, biblio_all, title_all</title>
<updated>2020-08-06T19:33:01+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-08-06T19:32:59+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=b33f69d92dbdfbf1be15521bf1893c053765a580'/>
<id>urn:sha1:b33f69d92dbdfbf1be15521bf1893c053765a580</id>
<content type='text'>
Want phrase queries to be faster. Expect this to increase term index
size, requiring more disk space.
</content>
</entry>
<entry>
<title>ES schema: do not index fulltext.body or fulltext.annex separately from 'everything'</title>
<updated>2020-08-06T19:32:09+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-08-06T19:31:53+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=8611d4cfd348b57120f936c064e9591c419a7ace'/>
<id>urn:sha1:8611d4cfd348b57120f936c064e9591c419a7ace</id>
<content type='text'>
The goal here is to reduce term index size. This means that
querying/matching only on these fields (distinct from "everything") will
not work.
</content>
</entry>
<entry>
<title>ES schema: use smaller integer size (short) for most numbers</title>
<updated>2020-08-06T19:31:10+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-08-06T19:31:10+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=8b5a78cd13ebfe5843bb0f04839afde69e09bb59'/>
<id>urn:sha1:8b5a78cd13ebfe5843bb0f04839afde69e09bb59</id>
<content type='text'>
</content>
</entry>
<entry>
<title>ES schema: copy_to titles into single title_all field</title>
<updated>2020-08-06T19:30:44+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-08-06T19:30:44+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=da2e89152d882ee0ace9878c7ceba2567f8b01a1'/>
<id>urn:sha1:da2e89152d882ee0ace9878c7ceba2567f8b01a1</id>
<content type='text'>
</content>
</entry>
<entry>
<title>schema: 12 shards, 0 replicas, more compression</title>
<updated>2020-07-27T22:53:51+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-07-27T22:53:51+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=0c7a2ace5d7c5b357dd4afa708a07e3fa85849fd'/>
<id>urn:sha1:0c7a2ace5d7c5b357dd4afa708a07e3fa85849fd</id>
<content type='text'>
</content>
</entry>
<entry>
<title>schema: access as object (list), not nested</title>
<updated>2020-07-21T20:47:11+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-07-21T20:47:07+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=f6db34857ea6e09d60f9d085cead3f045d171b84'/>
<id>urn:sha1:f6db34857ea6e09d60f9d085cead3f045d171b84</id>
<content type='text'>
Nested allows more precise filter queries, but it seems that simple "dot
notation" filters/queries don't work. We don't have anything doing the
sophisticated queries yet, so keep it simple.
</content>
</entry>
<entry>
<title>'tag' alias for 'tags'</title>
<updated>2020-06-04T20:41:13+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-06-04T20:41:13+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=264dadb59b046794880121eaad7f0f8082568cd1'/>
<id>urn:sha1:264dadb59b046794880121eaad7f0f8082568cd1</id>
<content type='text'>
</content>
</entry>
<entry>
<title>collapse pages by SIM issue</title>
<updated>2020-06-04T20:18:35+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-06-04T20:18:35+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=198db52d3a93a2b7d7cab0a4140c6402a14eca84'/>
<id>urn:sha1:198db52d3a93a2b7d7cab0a4140c6402a14eca84</id>
<content type='text'>
</content>
</entry>
<entry>
<title>HTML strip in ES indexing</title>
<updated>2020-05-22T03:23:45+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-05-22T03:23:45+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat-scholar/commit/?id=fee17cf6518e13b6f1c3945dd769aba56d7606d5'/>
<id>urn:sha1:fee17cf6518e13b6f1c3945dd769aba56d7606d5</id>
<content type='text'>
</content>
</entry>
</feed>
