Weekly emailed summaries of Arχiv submissions

I like the daily emails that the Arχiv has for new submissions. However, submissions are abundant and most are not immediately relevant. There is a nice API for searching articles and downloading metadata, and I use this API to generate weekly summaries of submissions based on keyword filters, which I send to myself each Monday.

The code for this is on github.

What this does

The main input is a filter file, which specifies repositories, authors and search strings. It strictly matches repositories, and is more lenient with other search criteria.

example_filter.txt
--------------------
Anything before the first header (RePOSITORIES: etc) is ignored.
We can put an email here, for use in another script.

# Comment lines are ignored
REPOSITORIES:
math.gr
math.at

AUTHORS:
Terrence Tao
Leonhard Euler
# Hyphens and potentially other symbols should be substituted for underscores.
Jean_Pierre Serre

# These headers are case sensitive and must contain the colon
KEYWORDS:
artin
coxter
braid
maths is cool

The main script (ideally scheduled by Cron) does the following:

  • The python script pulls relevant articles using the API and makes a summary which is a .html file.
  • Mutt sends this .html file to my email. I use a Zoho mail to send these emails. Zoho provides an SMTP server, which plays nicely with Mutt.

These emails look like this:


Get in touch if you have any questions.