What's the best paper you've read in 2020?

Question

I know there are classics that get posted every time this question comes around, so bias them towards more recent ones :)

tgb · Accepted Answer

Here's a wonderful one I read a little over a year ago:
"Estimating the number of unseen species: A bird in the hand is worth log(n) in the bush" https://arxiv.org/abs/1511.07428 https://www.pnas.org/content/113/47/13283
It deals with the classic, and wonderful, question of "If I go and catch 100 birds, and they're from 20 different species, how many species are left uncaught?" There's more one can say about that than it might first appear and it has plenty of applications. But mostly I just love the name. Apparently PNAS had them change it for the final publication, sadly.

loup-vaillant · Answer

Builds systems &agrave; la carte: Theory and practicehttps://www.cambridge.org/core/services/aop-cambridge-core/c...I've always hated build systems. Stuff cobbled together that barely works, yet a necessary step towards working software. This paper showed me there's hope. If we take build systems seriously, we can come up with something much better than most systems out there.

yellowcake0 · Answer

This Nature paper,Non-invasive early detection of cancer four years before conventional diagnosis using a blood testhttps://www.nature.com/articles/s41467-020-17316-zMajor breakthrough in cell-free diagnostics. The methylation pattern of DNA can be used to identify early-stage cancer, i.e. circulating tumor DNA (ctDNA) has a distinct methylation pattern.The results are based on data from a ten year study which must have cost a fortune to run.

screye · Answer

The pair of these papers: (Don't read them in full.)
1.Attention is not explanation (https://arxiv.org/abs/1902.10186)
2.Attention is not not Explanation (https://arxiv.org/abs/1908.04626)
Goes to show the complete lack of agreement between researchers in the explainability space. Most popular packages (allen NLP, google LIT, Captum) use saliency based methods (Integrated gradients) or Attention. The community has fundamental disagreements on whether they capture anything equivalent to importance as humans would understand it.
An entire community of fairness, ethics and Computational social science is built on top of conclusions using these methods. It is a shame that so much money is poured into these fields, but there does not seem to be as strong a thrust to explore the most fundamental questions themselves.
(my 2 cents: I like SHAP and the stuff coming out of Bin Yu and Tommi Jakkola's labs better..but my opinion too is based in intuition without any real rigor)

checkyoursudo · Answer

I think one of the most interesting papers I read this year was Hartshorne & Germine, 2015, "When does cognitive functioning peak? The asynchronous rise and fall of different cognitive abilities across the life span": https://doi.org/10.1177/0956797614567339
There are lots of good bits, such as: 'On the practical side, not only is there no age at which humans are performing at peak on all cognitive tasks, there may not be an age at which humans perform at peak on most cognitive tasks. Studies that compare the young or elderly to “normal adults” must carefully select the “normal” population.' (italics in original)
This seems to me to comport with the research suggesting that most or all of the variance in IQ across the life span can be accounted for by controlling for mental processing speed; i.e., you are generally faster when you are younger, but you are not more correct when you are younger.

luizfelberti · Answer

For me, it was "Erasure Coding in Windows Azure Storage" from Microsoft Research (2016) [0]
The idea that you can achieve the same practical effect of a 3x replication factor in a distributed system, but only increasing the cost of data storage by 1.6x, by leveraging some clever information theory tricks is mind bending to me.
If you're operating a large Ceph cluster, or you're Google/Amazon/Microsoft and you're running GCS/S3/ABS, if you needed 50PB HDDs before, you only need 27PB now (if implementing this).
The cost savings, and environmental impact reduction that this allows for are truly enormous, I'm surprised how little attention this paper has gotten in the wild.
[0] https://www.microsoft.com/en-us/research/wp-content/uploads/...

chrisdone · Answer

A paper that profoundly influenced my language design: “Programming with Polymorphic Variants” https://caml.inria.fr/pub/papers/garrigue-polymorphic_varian...

And the earlier paper “A Polymorphic Type System for Extensible Records and Variants” https://web.cecs.pdx.edu/~mpj/pubs/96-3.pdf

Row types are magically good: they serve either records or variants (aka sum types aka enums) equally well and both polymorphically. They’re duals. Here’s a diagram.

              Construction                Inspection

    Records   {x:1} : {x:Int}             r.x — r : {x:Int|r}
              [closed]                    [open; note the row variable r]
    
    Variants  ‘Just 1 :       case v of ‘Just 0 -> ...
              [open; note the row var v]  v : 
                                          [closed]

Neither have to be declared ahead of time, making them a perfect fit in the balance between play and serious work on my programming language.

lacker · Answer

Attention Is All You Needhttps://arxiv.org/abs/1706.03762It's from 2017 but I first read it this year. This is the paper that defined the "transformer" architecture for deep neural nets. Over the past few years, transformers have become a more and more common architecture, most notably with GPT-3 but also in other domains besides text generation. The fundamental principle behind the transformer is that it can detect patterns among an O(n) input size without requiring an O(n^2) size neural net.If you are interested in GPT-3 and want to read something beyond the GPT-3 paper itself, I think this is the best paper to read to get an understanding of this transformer architecture.

accraze · Answer

Three papers stick out for me in the IML / participatory machine learning space this year:
1) Michael, C. J., Acklin, D., & Scheuerman, J. (2020). On interactive machine learning and the potential of cognitive feedback. ArXiv:2003.10365 [Cs]. http://arxiv.org/abs/2003.10365
2) Denton, E., Hanna, A., Amironesei, R., Smart, A., Nicole, H., & Scheuerman, M. K. (2020). Bringing the people back in: Contesting benchmark machine learning datasets. ArXiv:2007.07399 [Cs]. http://arxiv.org/abs/2007.07399
3) Jo, E. S., & Gebru, T. (2020). Lessons from archives: Strategies for collecting sociocultural data in machine learning. Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 306–316. https://doi.org/10.1145/3351095.3372829
Also a great read related to IML tooling for audio recognition:
1) Ishibashi, T., Nakao, Y., & Sugano, Y. (2020). Investigating audio data visualization for interactive sound recognition. Proceedings of the 25th International Conference on Intelligent User Interfaces, 67–77. https://doi.org/10.1145/3377325.3377483

groceryheist · Answer

Measuring the predictability of life outcomes with a scientific mass collaboration.
http://www.pnas.org/lookup/doi/10.1073/pnas.1915006117
You might think that it's possible to use machine learning to predict whether people will be successful using established socio-demographic, psychological, and educational metrics. It turns out that it's very hard and simple regression models outperform the fanciest machine learning ideas for this problem.
The way this study was done is also interesting and paves the way for new kinds of collaborative scientific projects that take on big questions. It draws on communities like Kaggle, but applies it to scientific questions not just pure prediction problems.

SurfingToad · Answer

One of my favorites is definitely A Unified Framework for Dopamine Signals across Timescales (https://doi.org/10.1016/j.cell.2020.11.013), simply because of its experimental design. They 'teleported' rats in VR to see how their dopamine neurons responded, to determine whether TD learning explains dopamine signals on both short and long timescales. Short answer: it does.

m3at · Answer

On the Measure of Intelligence, Fran&ccedil;ois Chollet [1]Fellow HNer seems to have liked a lot of ML paper, this is not breaking the trend. This is a great meta paper questioning the goal of the field itself, and proposing ways to formally evaluate intelligence in a computational sense. Chollet is even ambitious enough to propose a proof of concept benchmark! [2] I also like some out of the box methods people tried to get closer to a solution, like this one combining cellular automata and ML [3][1] https://arxiv.org/abs/1911.01547 [2] https://github.com/fchollet/ARC [3] https://www.kaggle.com/arsenynerinovsky/cellular-automata-as...

louis-paul · Answer

Meaningful Availability, Hauer et al.: https://www.usenix.org/system/files/nsdi20spring_hauer_prepu...A good incremental improvement in service level indicator measurements for large-scale cloud services.Obligatory The Morning Paper post: https://blog.acolyer.org/2020/02/26/meaningful-availability/

imglorp · Answer

Keeping CALM: When Distributed Consistency Is Easy
In computing theory, when do you actually need coordination to get consistency? They partition the space into two kinds of algorithm, and show that only one kinds needs coordination.
CACM, 9/2020. https://cacm.acm.org/magazines/2020/9/246941-keeping-calm/fu...

sriku · Answer

Not a paper, but a fantastic talk by Samy Bengio "Towards better understanding generalisation in deep learning" at ASRU2019.
Some pretty mind blowing insights - ex: if you replace one layer's weights in a trained classification network with the initialisation weights for the layer (or some intermediate checkpoint as well), many networks show relatively unaffected performance for certain layers ... which is seen as a generalisation since it amounts to parameter reduction. However, if you replace with fresh random weights (although initialisation state is itself another set of random weights), the loss is high! Some layers are more sensitive to this than others in different network architectures.
I recently summarised this to a friend who asked "what's the most important insight in deep learning?" - to which I said - "in a sufficiently high dimensional parameter space, there is always a direction in which you can move to reduce loss". I'm eager to hear other answers to that question here.

drej · Answer

1) The original MapReduce paper https://static.googleusercontent.com/media/research.google.c...2) Snowflake and its tiered storage, among other things http://pages.cs.wisc.edu/~yxy/cs839-s20/papers/snowflake.pdf

tfehring · Answer

A Conceptual Introduction to Hamiltonian Monte Carlo (2017) https://arxiv.org/abs/1701.02434

ggleason · Answer

I'm a big fan of the various "gradual" approaches so this paper really caught my eye.
Gradualizing the Calculus of Inductive Constructions (https://hal.archives-ouvertes.fr/hal-02896776/)
I'm not sure if this is precisely the direction things should go in order to improve the utilisation of specification within software development but it's a very important contribution. As yet my favourite development style has been with F-star but F-star also leaves me a bit in a lurch when the automatic system isn't able to find the answer. Too much hinting in the case of hard proofs.
Eventually there will be a system that lets you turn the crank up on specification late in the game, allows lots of the assertions to be discharged automatically, and then finally saddles you with the remaining proof obligations in a powerful proof assistant.

ricksunny · Answer

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7062204/
Chen, Y.W.; Yiu, C.B.; Wong, K.Y. Prediction of the SARS-CoV-2 (2019-nCoV) 3C-like protease (3CL (pro)) structure: Virtual screening reveals velpatasvir, ledipasvir, and other drug repurposing candidates. F1000Research 2020, 9, 129.
This paper (based on a machine learning-driven open source drug docking tool from Scripps Institute) from Feb/Mar formed the basis for the agriceutical venture I started for supporting pandemic management in Africa. We’re in late stage trialing talks with research institutes here in East Africa.
https://www.emske-phytochem.com

unemphysbro · Answer

I'm a former biophysicist bumbling my way into distributed systems; was learning rust and bumped into Frank McSherry's blog posts.
Thought the Naiad project is really cool!
https://cs.stanford.edu/~matei/courses/2015/6.S897/readings/...

vincentmarle · Answer

Murray S. Davis: "That's Interesting!: Towards a Phenomenology of Sociology and a Sociology of Phenomenology" https://proseminarcrossnationalstudies.files.wordpress.com/2...An interesting (no pun intended) paper on what makes papers (or anything in general) interesting.

gsvclass · Answer

I recently made public a personal project https://42papers.com to surface the top trending papers to read.

ragazzina · Answer

"Wait, There's Torture in Zootopia?: Examining the Prevalence of Torture in Popular Movies", 2019. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3342908

AlotOfReading · Answer

One that came across my desk this year was the Archaic Ghost Introgression paper (https://advances.sciencemag.org/content/6/7/eaax5097), which established genetic contribution from an unknown archaic species in modern West African populations. It's notable not only because of the cool findings, but also because the paper is a culmination of a whole number of broader advances.

ArtWomb · Answer

Fresh today: Chromosome-scale, haplotype-resolved assembly of human genomes ;)https://www.nature.com/articles/s41587-020-0711-0

mxcrossb · Answer

Scott Aaron son&rsquo;s paper &ldquo;The Busy Beaver Frontier&rdquo;. https://www.scottaaronson.com/papers/bb.pdfIt&rsquo;s fairly accessible to anyone who vaguely remembers their CS theory, and quite fun!

MaxBarraclough · Answer

I have to admit to skim reading, but, Finding and Understanding Bugs in C Compilers, by Yang, Chen, Eide, and Regehr, 2011. (Yes it's from 9 years ago.) It's an interesting and approachable read if you're into programming languages and compilers.https://www.cs.utah.edu/~regehr/papers/pldi11-preprint.pdf

migueloller · Answer

Kleppmann, et al.'s paper on OpSets [1], a specification for building CRDTs with support for an atomic tree move operation, was the best one for me.
Automerge [2] implements a variant of this.
[1] https://arxiv.org/abs/1805.04263
[2] https://github.com/automerge/automerge

bawolff · Answer

As a meta observation, its fascinating how few of these are computer related.Which is actually great because it gives me something to read on subjects im not familar with.

KhoomeiK · Answer

Discovering Symbolic Models from Deep Learning with Inductive Biases [1] trains graph neural nets on astrophysical phenomena and then performs symbolic regression to generate algebraic formulae to elegantly model the phenomena in a classical physics framework. It's largely gone under the radar but has pretty interesting implications for NLP and language theory in my opinion.
Direct Feedback Alignment Scales to Modern Deep Learning Tasks and Architectures [2] applies DFA, an approach to training neural nets without backprop, to modern architectures like the Transformer. It does surprisingly well and is a step in the right direction for biologically plausible neural nets as well as potentially significant efficiency gains.
Hopfield Networks is All You Need [3] analyzes the Transformer architecture as the classical Hopfield Network. This one got a lot of buzz on HN so I won't talk about it too much, but it's part of a slew of other analyses of the Transformer that basically show how generalizable the attention mechanism is. It also sorta confirms many researchers' inkling that Transformers are likely just memorizing patterns in their training corpus.
Edit: Adding a few interesting older NLP papers that I came across this year.
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding [4]
Do Syntax Trees Help Pre-trained Transformers Extract Information? [5]
Learning to Compose Neural Networks for Question Answering [6]
Parsing with Compositional Vector Grammars [7]
[1] https://arxiv.org/abs/2006.11287
[2] https://arxiv.org/abs/2006.12878
[3] https://arxiv.org/abs/2008.02217
[4] https://arxiv.org/abs/1908.04577
[5] https://arxiv.org/abs/2008.09084
[6] https://arxiv.org/abs/1601.01705
[7] https://www.aclweb.org/anthology/P13-1045/

fsnowdin · Answer

I only started reading papers this year and have only read two: An empirical study of Rust (found this on 4chan of all places), and a study of local-first software (found right here on HN). The latter truly got me thinking about the cloud services I was using and how your work really isn't yours if it's stored faraway on some cloud servers and not on your local machine. The introduction to Conflict-free Replicated Data Types (CRDTs) was excellent as well.

nojvek · Answer

Almost 2020. MuZero from DeepMind was a pretty amazing breakthrough. Single algorithm that can play Atari games, chess, go (and a variety of other board games) with super human ability.
https://deepmind.com/research/publications/Mastering-Atari-G...
It felt like a baby step towards general intelligence.

jeffbee · Answer

Snap: a Microkernel Approach to Host Networkinghttps://research.google/pubs/pub48630/

pgtan · Answer

This year discovery of Spinosaurus' tail changed significant the picture of that specie's look and behaviour.https://www.nationalgeographic.com/science/2020/04/first-spi...

azhenley · Answer

What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities https://web.eecs.utk.edu/~azh/pubs/Chattopadhyay2020CHI_Note...

NieDzejkob · Answer

I haven't exactly read many this year, but I really liked "An Answer to the Bose-Nelson Sorting Problem for 11 and 12 Channels" [1]. It describes many interesting algorithmic tricks to establish a lower bound for an easy to understand problem. Not exactly immediately practical, but still very interesting.
Note that it has been published on arXiv just yesterday; I helped review an earlier draft.
[1]: https://arxiv.org/abs/2012.04400

arjmandi · Answer

https://t.co/EuRuVazuKkErik Hoel in this paper offers an audacious hypothesis: Our brain, during the its evolution, has developed dreams as a way to solve over-fittingSince we&rsquo;re learning from a limited samples of data in the real world, chances of overfitting (I call it judgement) goes higher. In ML we inject randomness and noise to avoid overfitting. Hoel theory can explain why our dreams are so sparse & hallucinatory

simonsarris · Answer

35 EGGS PER DAY IN THE TREATMENT OF SEVERE BURNS (1975)
https://www.jprasurg.com/article/0007-1226(75)90127-7/pdf
Great read. Note if you're not going to read it that you yourself should not eat 35 eggs per day because these patients had calorie requirements of a little under 7000.

lcrmorin · Answer

Difficult to choose one this year (plenty of things happened + plenty of time to read things due to lock down).
"Equality of Opportunity in Supervised Learning" (https://arxiv.org/abs/1610.02413)
It explain the basic concept about fairness in ML. Very practical exemple in my domain knowledge that show the trade-off between fairness of an algo and overall performance (money). Really make you see what may go wrong with bias in ML. It shows, in my opinion, why we will have to regulate ML as corporation aren't really incentivized to deal with fairness. It also shows that there is different notions of fairness. So there will always be something that feel unfair and also doing something can always be interpreted as positive discrimination.

Balgair · Answer

https://web.stanford.edu/group/dlab/media/papers/chenNBT2020...
Deep brain optogenetics without intracranial surgery
"Achieving temporally precise, noninvasive control over specific neural cell types in the deep brain would advance the study of nervous system function. Here we use the potent channelrhodopsin ChRmine to achieve transcranial photoactivation of defined neural circuits, including midbrain and brainstem structures, at unprecedented depths of up to 7 mm with millisecond precision. Using systemic viral delivery of ChRmine, we demonstrate behavioral modulation without surgery, enabling implant-free deep brain optogenetics."

verdverm · Answer

Propagation Networks: A Flexible and Expressive Substrate for Computationhttps://groups.csail.mit.edu/genesis/papers/radul%202009.pdf

collyw · Answer

After all the discussion over masks it was nice to see an actual study done on them.https://www.acpjournals.org/doi/10.7326/M20-6817

Kelamir · Answer

&ldquo;Brain over body&rdquo;&ndash;A study on the willful regulation of autonomic function during cold exposure.https://booksc.xyz/book/68207697/475742

rubiquity · Answer

DBOS: A Proposal for a Data-Centric Operating System https://arxiv.org/abs/2007.11112Even if a bit impractical in some regrards, I think an operating system/cloud that you interact with like a database is something we should aspirationally strive for. We're spending too much time gluing things together and not enough time being productive. Databases are great at tracking and describing resources (much better than YAML) and stored procedures that are like Lambdas would be neat.

macleginn · Answer

DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning https://arxiv.org/abs/2006.08381A killer paper presenting an algorithm capable of inductive learning. ("DreamCoder solves both classic inductive programming tasks and creative tasks such as drawing pictures and building scenes. It rediscovers the basics of modern functional programming, vector algebra and classical physics, including Newton's and Coulomb's laws.")

sneeuwpopsneeuw · Answer

This year is the first year I actually started reading some papers about Computer Graphics. This old paper by Ken Perlin from 1985 "An Image Synthesizer" Inspired me a lot. It really showed me that if you have a real deep understanding of some basic principles like the sin function. you can create beautiful things.http://www.heathershrewsbury.com/dreu2010/wp-content/uploads...

nl · Answer

I only properly read the Lottery Ticket Hypothesis paper[1] properly at the start of 2020.I think it's going to be years before we understand this properly, but in 2020 we are beginning to see practical uses.At the moment I think it's a toss up: it's either going to be a curiosity that people read, have their mind exploded but can't do anything with or else it's a good chance to be the most influential deep learning paper of the decade.[1] https://arxiv.org/abs/1803.03635

Rocco_dt · Answer

This one is from 2008. There's this method from statistics called PCA that let's you reduce high dimensional data into a few (usually meaningless) newly-fabricated dimensions. It's useful to visualize complex data in 2d space.
In this paper, they did that with genes. And the 2d space that was left wasn't meaningless at all. It accurately recreated map of Europe.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2735096/

flybyair2048 · Answer

A Mathematical Model For Meat Cooking (2019): https://arxiv.org/pdf/1908.10787.pdf

DimiD · Answer

Resynthesizing behavior through phylogenetic refinement (https://link.springer.com/article/10.3758/s13414-019-01760-1)No one knows what attention is (https://link.springer.com/article/10.3758/s13414-019-01846-w)

vasergen · Answer

Out of topic, but could you suggest a good resource for papers? I am interested in software mostly

x14km2d · Answer

I found this scientific paper interesting, even though it took me a long time to understand the contents.Molecular repertoire of Deinococcus radiodurans after 1 year of exposure outside the International Space Station within the Tanpopo mission: https://microbiomejournal.biomedcentral.com/articles/10.1186...

ryukafalz · Answer

A Security Kernel Based on the Lambda Calculus from 1996 (http://mumble.net/~jar/pubs/secureos/secureos.html)I've been reading up on the object capability security model a lot recently, and was pointed to this paper... I was hooked. A really compelling security model almost from first principles.

gvhst · Answer

MDMA Increases Cooperation and Recruitment of Social Brain Areas When Playing Trustworthy Players in an Iterated Prisoner's Dilemma
https://www.jneurosci.org/content/39/2/307
Abstract ran through a text optimizer:
We administered 100 mg MDMA or placebo to 20 male participants in a double-blind, placebo-controlled, crossover study.
Cooperation with trustworthy, but not untrustworthy, opponents was enhanced following MDMA but not placebo.
Specifically, MDMA enhanced recovery from, but not the impact of, breaches in cooperation.
During trial outcome, MDMA increased activation of four clusters incorporating precentral and supramarginal, gyri, superior temporal cortex, central operculum/posterior insula, and supplementary motor area.
MDMA increased cooperative behavior when playing trustworthy opponents.
Our findings highlight the context-specific nature of MDMA's effect on social decision-making.
While breaches of trustworthy behavior have a similar impact following administration of MDMA compared with placebo, MDMA facilitates a greater recovery from these breaches of trust.

scythe · Answer

"Hypoxic radiosensitization: Adored and Ignored"
https://pdfs.semanticscholar.org/c26b/4d3156b0c526d16c891ce7...
>"three of the four most cited papers in the journals deal with hypoxia [...] yet its routine clinical use is very limited."

bausano_michael · Answer

Network topology design at 27,000 km/hour (2017) by Debopam Bhattacherjee, Ankit Singlahttps://people.inf.ethz.ch/asingla/papers/conext19.pdf

brokebroadbeat · Answer

Making Kin with the Machines, https://jods.mitpress.mit.edu/pub/lewis-arista-pechawis-kite...'What if we treated AI as equals, like other human beings, not as tools or, worse, slaves to their creators?' That's the premise to this paper, which is a wonderful provocation. It's a really important consideration too, when you consider how many of our decisions we're asking machine sentience to make for us. If algorithmic bias were a human judge, they'd be thrown out of court (you'd hope).

aniijbod · Answer

This study is extraordinary in terms of the extent to which it reveals just how little we fully understand about what is actually taking place in the genetic process. Quote from the end of the paper's introduction section: "In particular, the demonstration of highly efficient splicing in mammals in the absence of transcriptional pausing causes us to rethink key features of splicing regulation." https://www.biorxiv.org/content/10.1101/2020.02.11.944595v1....

axegon_ · Answer

No second thought about it, stylegan2[1] takes the cake.[1] https://arxiv.org/abs/1912.04958

thom · Answer

Probably few people interested in the subject matter here, but as a piece of gentle snark I found this wonderful:https://www.researchgate.net/publication/342317256_A_systema...

ilaksh · Answer

Learning Representations by Back-propagating Errors http://www.cs.toronto.edu/~hinton/absps/naturebp.pdf-- because it was relatively straightforward to understand and convert to code. So it helped me understand backprop.

salamanderman · Answer

Monte Carlo Geometry Processing: A Grid-Free Approach to PDE-Based Methods on Volumetric Domains https://www.cs.cmu.edu/~kmcrane/Projects/MonteCarloGeometryP...

sedlich · Answer

I was fascinated by this one: "The Case for a Learned Sorting Algorithm" i.e. invest in a little ML to sort faster afterwards ... https://dl.acm.org/doi/10.1145/3318464.3389752

Jaxtek · Answer

Facebook&rsquo;s photo storagehttps://www.usenix.org/legacy/event/osdi10/tech/full_papers/...

andrewnc · Answer

Awesome robotics paper from Deepmind.Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion: https://arxiv.org/abs/2008.12228

busterarm · Answer

It's an old paper by now and probably niche but was very useful to me this year.http://winsh.me/papers/erlang_workshop_2013.pdf

uptownhr · Answer

https://medium.com/craft-ventures/the-cadence-how-to-operate...

pinouchon · Answer

https://arxiv.org/pdf/2006.08381.pdf A take on Josh Tenenbaum's hard problem of learning

lostmsu · Answer

Implicit Neural Representations with Periodic Activation Functionsaka SIREN: https://vsitzmann.github.io/siren/

cebkr · Answer

https://arxiv.org/pdf/1703.01192.pdf

jimz · Answer

Hey, let's get some interesting humanities papers into the mix, since thanks to COVID I had a lot of extra reading time and "best" is purely subjective:
"Erotic Modesty: (Ad)dressing Female Sexuality and Propriety in Open and Closed Drawers, USA, 1800–1930" https://onlinelibrary.wiley.com/doi/abs/10.1111/1468-0424.00...
When JAmes C. Scott wrote about infrapolitcs in his 1990 work "Domination and the Arts of Resistance: Hidden Transcripts" (https://www.jstor.org/stable/j.ctt1np6zz) and described it as a sort of political resistance that never declares itself and remains beneath what the dominant group can properly perceive until the power shift actually starts to happen, he probably didn't think of a case where the undeclared politics is so literally something not meant to be seen. The theme is very much a progression of how slowly women were able to establish even what parts of their body can be sexualized or not sexualized, and culminates in a sudden burst, or power shift, in the 1910s-1930s after centuries of aggregating individual choices and entirely unseen acts. This particular revolution managed to happen almost entirely outside of organization and public view and while it's by no means over, the progress made in the last twenty years covered in the paper really show how productive the aggregation of individual acts of resistance, done without any open plans, can still bring about so much change. It also showed the limits of such movements, particularly whe the dominant group rhas an active interest in preserving that status quo.
"How Qualified Immunity Fails" https://scholarship.law.nd.edu/cgi/viewcontent.cgi?article=4...
and "The Case Against Qualified Immunity" https://scholarship.law.nd.edu/cgi/viewcontent.cgi?article=4...
These two were both written by UCLA Law professor Johanna Schwartz over the course of about a year and half from 2017-2018, and really got a lot of attention this year when a lot of people for the first time asked "why does it seem impossible to actually hold abusive police to some degree of personal responsibility?" Having worked at a public defender's office and then on federal CJA cases (essentially federal defense work when there is more than one codefendant and the federal defenders would have a conflict of interest defending both), the abusive nature of policing was very much something that I saw constantly for years but it's difficult to quantify just how little potential consequence a police officer may actually face because nobody had done the shoeleather work to collect the data, and police departments tend to have opacity written into their contracts. The actual data collected by Schwartz demonstrating how the multiple layers of shielding negotiated into police contracts and just how much indemnification, which is actually illegal in many jurisdictions but universally ignored, pushes any potential liability onto taxpayers directly, creating a situation where victims' taxes are just getting looped back into the settlements they receive. There are a lot of problems in the criminal justice and really any carceral system this country runs, and most of it are poorly documented on a systemic level and difficult to quantify. It's nice to see that someone put in the work to make the picture a little clearer, as practitioners tend to be entirely focused on their clients to do research like this and this is a particularly unglamorous field of research.

priyanshuraj · Answer

Unix shell command languages by Ken Thompson

hn_throwaway_99 · Answer

Not so much a classic but relevant to today:MMR vaccine could protect against COVID-19https://mbio.asm.org/content/11/6/e02628-20?_ga=2.139230451....

itronitron · Answer

https://www.academia.edu/41743064/Systemic_Risk_of_Pandemic_...

bra-ket · Answer

Some CogSci & Neuro papers I found interesting in 2020:
Constantinescu, Alexandra O., Jill X. O’Reilly, and Timothy EJ Behrens. "Organizing conceptual knowledge in humans with a gridlike code." Science 352.6292 (2016): 1464-1468.
Kriegeskorte, Nikolaus, and Katherine R. Storrs. "Grid cells for conceptual spaces?." Neuron 92.2 (2016): 280-284.
Klukas, Mirko, Marcus Lewis, and Ila Fiete. "Efficient and flexible representation of higher-dimensional cognitive variables with grid cells." PLOS Computational Biology 16.4 (2020): e1007796.
Moser, May-Britt, David C. Rowland, and Edvard I. Moser. "Place cells, grid cells, and memory." Cold Spring Harbor perspectives in biology 7.2 (2015): a021808.
Quiroga, Rodrigo Quian. "Concept cells: the building blocks of declarative memory functions." Nature Reviews Neuroscience 13.8 (2012): 587-597.
Stachenfeld, Kimberly L., Matthew M. Botvinick, and Samuel J. Gershman. "The hippocampus as a predictive map." Nature neuroscience 20.11 (2017): 1643.
Buzsáki, György, and David Tingley. "Space and time: The hippocampus as a sequence generator." Trends in cognitive sciences 22.10 (2018): 853-869.
Umbach, Gray, et al. "Time cells in the human hippocampus and entorhinal cortex support episodic memory." bioRxiv (2020).
Eichenbaum, Howard. "On the integration of space, time, and memory." Neuron 95.5 (2017): 1007-1018.
Schiller, Daniela, et al. "Memory and space: towards an understanding of the cognitive map." Journal of Neuroscience 35.41 (2015): 13904-13911.
Rolls, Edmund T., and Alessandro Treves. "The neuronal encoding of information in the brain." Progress in neurobiology 95.3 (2011): 448-490.
Fischer, Lukas F., et al. "Representation of visual landmarks in retrosplenial cortex." Elife 9 (2020): e51458.
Hebart, Martin, et al. "Revealing the multidimensional mental representations of natural objects underlying human similarity judgments." (2020).
Ezzyat, Youssef, and Lila Davachi. "Similarity breeds proximity: pattern similarity within and across contexts is related to later mnemonic judgments of temporal proximity." Neuron 81.5 (2014): 1179-1189.
Seger, Carol A., and Earl K. Miller. "Category learning in the brain." Annual review of neuroscience 33 (2010): 203-219.
Neurolinguistics:
Marcus, Gary F. "Evolution, memory, and the nature of syntactic representation." Birdsong, speech, and language: Exploring the evolution of mind and brain 27 (2013).
Dehaene, Stanislas, et al. "The neural representation of sequences: from transition probabilities to algebraic patterns and linguistic trees." Neuron 88.1 (2015): 2-19.
Fujita, Koji. "On the parallel evolution of syntax and lexicon: A Merge-only view." Journal of Neurolinguistics 43 (2017): 178-192.

What's the best paper you've read in 2020?

I know there are classics that get posted every time this question comes around, so bias them towards more recent ones :)

1) The original MapReduce paper https://static.googleusercontent.com/media/research.google.c...
2) Snowflake and its tiered storage, among other things http://pages.cs.wisc.edu/~yxy/cs839-s20/papers/snowflake.pdf

A Conceptual Introduction to Hamiltonian Monte Carlo (2017) https://arxiv.org/abs/1701.02434

I'm a former biophysicist bumbling my way into distributed systems; was learning rust and bumped into Frank McSherry's blog posts.
Thought the Naiad project is really cool!
https://cs.stanford.edu/~matei/courses/2015/6.S897/readings/...

Murray S. Davis: "That's Interesting!: Towards a Phenomenology of Sociology and a Sociology of Phenomenology" https://proseminarcrossnationalstudies.files.wordpress.com/2...
An interesting (no pun intended) paper on what makes papers (or anything in general) interesting.

I recently made public a personal project https://42papers.com to surface the top trending papers to read.

"Wait, There's Torture in Zootopia?: Examining the Prevalence of Torture in Popular Movies", 2019. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3342908

Fresh today: Chromosome-scale, haplotype-resolved assembly of human genomes ;)
https://www.nature.com/articles/s41587-020-0711-0

Scott Aaron son’s paper “The Busy Beaver Frontier”. https://www.scottaaronson.com/papers/bb.pdf
It’s fairly accessible to anyone who vaguely remembers their CS theory, and quite fun!

Kleppmann, et al.'s paper on OpSets [1], a specification for building CRDTs with support for an atomic tree move operation, was the best one for me.
Automerge [2] implements a variant of this.
[1] https://arxiv.org/abs/1805.04263
[2] https://github.com/automerge/automerge

As a meta observation, its fascinating how few of these are computer related.
Which is actually great because it gives me something to read on subjects im not familar with.

Snap: a Microkernel Approach to Host Networking
https://research.google/pubs/pub48630/

This year discovery of Spinosaurus' tail changed significant the picture of that specie's look and behaviour.
https://www.nationalgeographic.com/science/2020/04/first-spi...

What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities https://web.eecs.utk.edu/~azh/pubs/Chattopadhyay2020CHI_Note...

35 EGGS PER DAY IN THE TREATMENT OF SEVERE BURNS (1975)
https://www.jprasurg.com/article/0007-1226(75)90127-7/pdf
Great read. Note if you're not going to read it that you yourself should not eat 35 eggs per day because these patients had calorie requirements of a little under 7000.

Propagation Networks: A Flexible and Expressive Substrate for Computation
https://groups.csail.mit.edu/genesis/papers/radul%202009.pdf

After all the discussion over masks it was nice to see an actual study done on them.
https://www.acpjournals.org/doi/10.7326/M20-6817

“Brain over body”–A study on the willful regulation of autonomic function during cold exposure.
https://booksc.xyz/book/68207697/475742

A Mathematical Model For Meat Cooking (2019): https://arxiv.org/pdf/1908.10787.pdf

Resynthesizing behavior through phylogenetic refinement (https://link.springer.com/article/10.3758/s13414-019-01760-1)
No one knows what attention is (https://link.springer.com/article/10.3758/s13414-019-01846-w)

Out of topic, but could you suggest a good resource for papers? I am interested in software mostly

A Security Kernel Based on the Lambda Calculus from 1996 (http://mumble.net/~jar/pubs/secureos/secureos.html)
I've been reading up on the object capability security model a lot recently, and was pointed to this paper... I was hooked. A really compelling security model almost from first principles.

"Hypoxic radiosensitization: Adored and Ignored"
https://pdfs.semanticscholar.org/c26b/4d3156b0c526d16c891ce7...
>"three of the four most cited papers in the journals deal with hypoxia [...] yet its routine clinical use is very limited."

Network topology design at 27,000 km/hour (2017) by Debopam Bhattacherjee, Ankit Singla
https://people.inf.ethz.ch/asingla/papers/conext19.pdf

No second thought about it, stylegan2[1] takes the cake.
[1] https://arxiv.org/abs/1912.04958

Probably few people interested in the subject matter here, but as a piece of gentle snark I found this wonderful:
https://www.researchgate.net/publication/342317256_A_systema...

Learning Representations by Back-propagating Errors http://www.cs.toronto.edu/~hinton/absps/naturebp.pdf
-- because it was relatively straightforward to understand and convert to code. So it helped me understand backprop.

Monte Carlo Geometry Processing: A Grid-Free Approach to PDE-Based Methods on Volumetric Domains https://www.cs.cmu.edu/~kmcrane/Projects/MonteCarloGeometryP...

I was fascinated by this one: "The Case for a Learned Sorting Algorithm" i.e. invest in a little ML to sort faster afterwards ... https://dl.acm.org/doi/10.1145/3318464.3389752

Facebook’s photo storage
https://www.usenix.org/legacy/event/osdi10/tech/full_papers/...

Awesome robotics paper from Deepmind.
Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion: https://arxiv.org/abs/2008.12228

It's an old paper by now and probably niche but was very useful to me this year.
http://winsh.me/papers/erlang_workshop_2013.pdf

https://medium.com/craft-ventures/the-cadence-how-to-operate...

https://arxiv.org/pdf/2006.08381.pdf A take on Josh Tenenbaum's hard problem of learning

Implicit Neural Representations with Periodic Activation Functions
aka SIREN: https://vsitzmann.github.io/siren/

https://arxiv.org/pdf/1703.01192.pdf

Unix shell command languages by Ken Thompson

Not so much a classic but relevant to today:
MMR vaccine could protect against COVID-19
https://mbio.asm.org/content/11/6/e02628-20?_ga=2.139230451....

https://www.academia.edu/41743064/Systemic_Risk_of_Pandemic_...

What's the best paper you've read in 2020?

I know there are classics that get posted every time this question comes around, so bias them towards more recent ones :)

Meaningful Availability, Hauer et al.: https://www.usenix.org/system/files/nsdi20spring_hauer_prepu...A good incremental improvement in service level indicator measurements for large-scale cloud services.Obligatory The Morning Paper post: https://blog.acolyer.org/2020/02/26/meaningful-availability/

1) The original MapReduce paper https://static.googleusercontent.com/media/research.google.c...2) Snowflake and its tiered storage, among other things http://pages.cs.wisc.edu/~yxy/cs839-s20/papers/snowflake.pdf

A Conceptual Introduction to Hamiltonian Monte Carlo (2017) https://arxiv.org/abs/1701.02434

I'm a former biophysicist bumbling my way into distributed systems; was learning rust and bumped into Frank McSherry's blog posts.Thought the Naiad project is really cool!https://cs.stanford.edu/~matei/courses/2015/6.S897/readings/...

Murray S. Davis: "That's Interesting!: Towards a Phenomenology of Sociology and a Sociology of Phenomenology" https://proseminarcrossnationalstudies.files.wordpress.com/2...An interesting (no pun intended) paper on what makes papers (or anything in general) interesting.

I recently made public a personal project https://42papers.com to surface the top trending papers to read.

"Wait, There's Torture in Zootopia?: Examining the Prevalence of Torture in Popular Movies", 2019. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3342908

Fresh today: Chromosome-scale, haplotype-resolved assembly of human genomes ;)https://www.nature.com/articles/s41587-020-0711-0

Scott Aaron son’s paper “The Busy Beaver Frontier”. https://www.scottaaronson.com/papers/bb.pdfIt’s fairly accessible to anyone who vaguely remembers their CS theory, and quite fun!

Kleppmann, et al.'s paper on OpSets [1], a specification for building CRDTs with support for an atomic tree move operation, was the best one for me.Automerge [2] implements a variant of this.[1] https://arxiv.org/abs/1805.04263[2] https://github.com/automerge/automerge

As a meta observation, its fascinating how few of these are computer related.Which is actually great because it gives me something to read on subjects im not familar with.

Snap: a Microkernel Approach to Host Networkinghttps://research.google/pubs/pub48630/

This year discovery of Spinosaurus' tail changed significant the picture of that specie's look and behaviour.https://www.nationalgeographic.com/science/2020/04/first-spi...

What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities https://web.eecs.utk.edu/~azh/pubs/Chattopadhyay2020CHI_Note...

35 EGGS PER DAY IN THE TREATMENT OF SEVERE BURNS (1975)https://www.jprasurg.com/article/0007-1226(75)90127-7/pdfGreat read. Note if you're not going to read it that you yourself should not eat 35 eggs per day because these patients had calorie requirements of a little under 7000.

Propagation Networks: A Flexible and Expressive Substrate for Computationhttps://groups.csail.mit.edu/genesis/papers/radul%202009.pdf

After all the discussion over masks it was nice to see an actual study done on them.https://www.acpjournals.org/doi/10.7326/M20-6817

“Brain over body”–A study on the willful regulation of autonomic function during cold exposure.https://booksc.xyz/book/68207697/475742

A Mathematical Model For Meat Cooking (2019): https://arxiv.org/pdf/1908.10787.pdf

Resynthesizing behavior through phylogenetic refinement (https://link.springer.com/article/10.3758/s13414-019-01760-1)No one knows what attention is (https://link.springer.com/article/10.3758/s13414-019-01846-w)

Out of topic, but could you suggest a good resource for papers? I am interested in software mostly

A Security Kernel Based on the Lambda Calculus from 1996 (http://mumble.net/~jar/pubs/secureos/secureos.html)I've been reading up on the object capability security model a lot recently, and was pointed to this paper... I was hooked. A really compelling security model almost from first principles.

"Hypoxic radiosensitization: Adored and Ignored"https://pdfs.semanticscholar.org/c26b/4d3156b0c526d16c891ce7...>"three of the four most cited papers in the journals deal with hypoxia [...] yet its routine clinical use is very limited."

Network topology design at 27,000 km/hour (2017) by Debopam Bhattacherjee, Ankit Singlahttps://people.inf.ethz.ch/asingla/papers/conext19.pdf

No second thought about it, stylegan2[1] takes the cake.[1] https://arxiv.org/abs/1912.04958

Probably few people interested in the subject matter here, but as a piece of gentle snark I found this wonderful:https://www.researchgate.net/publication/342317256_A_systema...

Learning Representations by Back-propagating Errors http://www.cs.toronto.edu/~hinton/absps/naturebp.pdf-- because it was relatively straightforward to understand and convert to code. So it helped me understand backprop.

Monte Carlo Geometry Processing: A Grid-Free Approach to PDE-Based Methods on Volumetric Domains https://www.cs.cmu.edu/~kmcrane/Projects/MonteCarloGeometryP...

I was fascinated by this one: "The Case for a Learned Sorting Algorithm" i.e. invest in a little ML to sort faster afterwards ... https://dl.acm.org/doi/10.1145/3318464.3389752

Facebook’s photo storagehttps://www.usenix.org/legacy/event/osdi10/tech/full_papers/...

Awesome robotics paper from Deepmind.Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion: https://arxiv.org/abs/2008.12228

It's an old paper by now and probably niche but was very useful to me this year.http://winsh.me/papers/erlang_workshop_2013.pdf

https://medium.com/craft-ventures/the-cadence-how-to-operate...

https://arxiv.org/pdf/2006.08381.pdf A take on Josh Tenenbaum's hard problem of learning

Implicit Neural Representations with Periodic Activation Functionsaka SIREN: https://vsitzmann.github.io/siren/

https://arxiv.org/pdf/1703.01192.pdf

Unix shell command languages by Ken Thompson

Not so much a classic but relevant to today:MMR vaccine could protect against COVID-19https://mbio.asm.org/content/11/6/e02628-20?_ga=2.139230451....

https://www.academia.edu/41743064/Systemic_Risk_of_Pandemic_...

1) The original MapReduce paper https://static.googleusercontent.com/media/research.google.c...
2) Snowflake and its tiered storage, among other things http://pages.cs.wisc.edu/~yxy/cs839-s20/papers/snowflake.pdf

I'm a former biophysicist bumbling my way into distributed systems; was learning rust and bumped into Frank McSherry's blog posts.
Thought the Naiad project is really cool!
https://cs.stanford.edu/~matei/courses/2015/6.S897/readings/...

Murray S. Davis: "That's Interesting!: Towards a Phenomenology of Sociology and a Sociology of Phenomenology" https://proseminarcrossnationalstudies.files.wordpress.com/2...
An interesting (no pun intended) paper on what makes papers (or anything in general) interesting.

Fresh today: Chromosome-scale, haplotype-resolved assembly of human genomes ;)
https://www.nature.com/articles/s41587-020-0711-0

Scott Aaron son’s paper “The Busy Beaver Frontier”. https://www.scottaaronson.com/papers/bb.pdf
It’s fairly accessible to anyone who vaguely remembers their CS theory, and quite fun!

Kleppmann, et al.'s paper on OpSets [1], a specification for building CRDTs with support for an atomic tree move operation, was the best one for me.
Automerge [2] implements a variant of this.
[1] https://arxiv.org/abs/1805.04263
[2] https://github.com/automerge/automerge

As a meta observation, its fascinating how few of these are computer related.
Which is actually great because it gives me something to read on subjects im not familar with.

Snap: a Microkernel Approach to Host Networking
https://research.google/pubs/pub48630/

This year discovery of Spinosaurus' tail changed significant the picture of that specie's look and behaviour.
https://www.nationalgeographic.com/science/2020/04/first-spi...

35 EGGS PER DAY IN THE TREATMENT OF SEVERE BURNS (1975)
https://www.jprasurg.com/article/0007-1226(75)90127-7/pdf
Great read. Note if you're not going to read it that you yourself should not eat 35 eggs per day because these patients had calorie requirements of a little under 7000.

Propagation Networks: A Flexible and Expressive Substrate for Computation
https://groups.csail.mit.edu/genesis/papers/radul%202009.pdf

After all the discussion over masks it was nice to see an actual study done on them.
https://www.acpjournals.org/doi/10.7326/M20-6817

“Brain over body”–A study on the willful regulation of autonomic function during cold exposure.
https://booksc.xyz/book/68207697/475742

Resynthesizing behavior through phylogenetic refinement (https://link.springer.com/article/10.3758/s13414-019-01760-1)
No one knows what attention is (https://link.springer.com/article/10.3758/s13414-019-01846-w)

A Security Kernel Based on the Lambda Calculus from 1996 (http://mumble.net/~jar/pubs/secureos/secureos.html)
I've been reading up on the object capability security model a lot recently, and was pointed to this paper... I was hooked. A really compelling security model almost from first principles.

"Hypoxic radiosensitization: Adored and Ignored"
https://pdfs.semanticscholar.org/c26b/4d3156b0c526d16c891ce7...
>"three of the four most cited papers in the journals deal with hypoxia [...] yet its routine clinical use is very limited."

Network topology design at 27,000 km/hour (2017) by Debopam Bhattacherjee, Ankit Singla
https://people.inf.ethz.ch/asingla/papers/conext19.pdf

No second thought about it, stylegan2[1] takes the cake.
[1] https://arxiv.org/abs/1912.04958

Probably few people interested in the subject matter here, but as a piece of gentle snark I found this wonderful:
https://www.researchgate.net/publication/342317256_A_systema...

Learning Representations by Back-propagating Errors http://www.cs.toronto.edu/~hinton/absps/naturebp.pdf
-- because it was relatively straightforward to understand and convert to code. So it helped me understand backprop.

Facebook’s photo storage
https://www.usenix.org/legacy/event/osdi10/tech/full_papers/...

Awesome robotics paper from Deepmind.
Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion: https://arxiv.org/abs/2008.12228

It's an old paper by now and probably niche but was very useful to me this year.
http://winsh.me/papers/erlang_workshop_2013.pdf

Implicit Neural Representations with Periodic Activation Functions
aka SIREN: https://vsitzmann.github.io/siren/

Not so much a classic but relevant to today:
MMR vaccine could protect against COVID-19
https://mbio.asm.org/content/11/6/e02628-20?_ga=2.139230451....