Fine-tuned BERT failed completed at predicting if a headline would get upvoted on HN or what the comments/votes ratios would be. Those are even noisier than the recommender so it comes as no surprise to me. (For both of those my best model is still a bag-of-words model.)