arxiv articles by identifier

Log in to Vote
22
22 Votes • 6 Comments
A search of the form "arxiv:1010.2163v1" or "arxiv:quant-ph/0508054" should return the title of the relevant arxiv article along with links to the abstract and pdf. There is an API available (e.g. http://export.arxiv.org/api/query?id_lis...) but it returns XML rather than JSON. Note that the forms I mentioned, with the word "arxiv" then a colon then the identifier, are a somewhat standard way of referring to arxiv articles.
• posted 6 years and 4 months ago • type: Spice (API calls) Needs a Developer

anonymous
I would like to see the full arxiv article text indexed, not just the identifiers, so that searches for key-words or article titles or sentences from abstracts return links to arXiv articles. Currently duck duck go appears to have no index of arXiv at all. Google does. Crawling arXiv must be arranged with arXiv in advance or your IP addresses will be banned.
posted by [UserVoice Anonymous] • 6 years and 1 month ago Link
anonymous
One more thing: sometimes arxiv is spelled "arXiv" (which is in fact the proper spelling). So the identifiers could look like "arXiv:1010.2163".
posted by <hidden> • 6 years and 4 months ago Link
anonymous
I explored this idea recently and looked into arxiv's API. They only have an XML endpoint though so I built a wrapper http://www.arxivwrap.org which converts their data to JSON. As I understand it the DDG policy is to not use that kind of thing (a wrapper around a public endpoint) and I understand all the reasons why. So I tried contacting the arxiv API maintainers through the google group but have yet to hear back from them (this was nearly a month ago).

Any thoughts on how to proceed?
posted by <hidden> • 3 years and 4 months ago Link
Jag
We can use this endpoint instead of using arxivwrap.org: https://github.com/duckduckgo/zeroclicki...
posted by Jag Staff2 years and 10 months ago Link
anonymous
hi Jag, not sure I follow, do you mind explaining a bit more? What does that URL do?
posted by <hidden> • 2 years and 10 months ago Link
Moollaza
Sorry for the long delay. I've spoken to the team about permitting XML and we think this is a great test project. Are you still interested in building a Spice for this? If so please contact me, moollaza@duckduckgo.com to discuss this.

I'm also on Slack, and happy to chat there. You can join slack by sending an email to quackslack@duckduckgo.com
posted by Moollaza Staff2 years and 9 months ago Link