Neural Temporality Adaptation for Document Classification: Diachronic Word Embeddings and Domain Adaptation Models