NLP for Historical Texts
Natural language processing for historical texts, computational linguistics and the digital humanities
View ArticleWorkshop on Computational Historical Linguistics at NoDaLiDa 2013
It’s been a couple of weeks now that I attended the NoDaLiDa 2013 workshop on Computational Historical Linguistics, where I gave an invited talk. The workshop—and Oslo in general—was a very pleasant...
View ArticletranScriptorium
This is just a brief project note: tranScriptorium is an EU-funded (FP 7 STReP) project, which aims to “develop innovative, efficient and cost-effective solutions for the indexing, search and full...
View ArticleVARD 2.5 Released
Another short project note: Last week, Alistair Baron released a new version (2.5) of VARD. VARD is one of the most popular programs for normalizing or modernizing historical texts prior to linguistic...
View ArticleTimeo Danaos… Or: Why I’m Wary of Google Docs
During the last couple of weeks I had to use Google Docs quite heavily in order to work on a large, collaborative document. Without any doubt, Google Docs is a convenient platform for collaborative...
View ArticleYet Another Publication List?
Cerstin Mahlow recently had a very good blog post on storing your publication list online with services such as ResearchGate, Mendeley, or Academia.edu. Cerstin expressed a preference for CiteULike,...
View ArticleDigital Humanities Defined
As I’m writing this, I’m on my way back from the conference “(Digital) Humanities Revisited – Challenges and Opportunities in the Digital Age,” which was organized by the Volkswagen Foundation and...
View ArticleA Review of Natural Language Processing for Historical Texts
LINGUIST List issue 25.292 features a review of my book Natural Language Processing for Historical Texts by Bev Thurber. I was very happy to read that she thinks that the book is “a good overview of...
View ArticleA Small Exercise in Computer Archeology
My first computer was an Atari 1040STF, which I used until about 1995, when I replaced it with an HP 715/50 workstation. I put the Atari away in the attic, where it remained for the last 18 years....
View ArticleBooks vs. Blogs
I’m not sure whether this is a contribution to the #wbhyp blog parade of de.hypotheses (see Wissenschaftsbloggen: zurück in die Zukunft – ein Aufruf zur Blogparade), but it does relate to some of the...
View ArticleAnother Enthusiastic Review of Natural Language Processing for Historical Texts
I’d like to (belatedly) point you to another review of my book Natural Language Processing for Historical Texts. The review by Laurent Romary appeared in Computational Linguistics 40:1, 231–232, and I...
View ArticleWriting For Hypotheses in Org-mode
A fact that should perhaps be emphasized when promulgating academic blogging is: it’s not for everyone. If you can regularly write a sparkling essay in half an hour, say, during your commute or your...
View ArticleA Live Publication List For My Web Site
The main problem with Twitter is that everything you post there is publicly visible. Oh, wait—are you saying that’s intentional? Well, then. Anne Baillot recently mentioned on Twitter she’s numbering...
View ArticleMendeley Revisited
A while ago, in a post titled Yet Another Publication List?, I ranted about the proliferation of online reference managers and speculated about their business models. A comment by William Gunn, Head of...
View ArticleThe Good, the Bad, and the Ugly
It’s already some time now since I’m back from the digital humanities conference double pack, which involved about 3,000 km of rail travel: first DHd 2015 in Graz, then the DH Summit in Berlin. In...
View ArticleDocument Engineering and Digital Humanities
Documents of all types have always played a central role in the humanities, both of an object of inquiry and for documenting research results. The digital humanities are thus also concerned with...
View ArticleBooks, Blogs, and Dialectics
Over the Easter holidays I read the little book Tschichold – na und? (‘Tschichold—so what?’) by Gerd Fleischmann (Wallstein, 2013). According to the introduction, it’s a collection of 24 “miniatures”...
View ArticleDH-MAX
In his recent blog post Die Digital Humanities brauchen ein Ziel: DH-MAX, Wolfgang Schmale wondered whether the digital humanities need a goal and suggested one potential goal, which he calls “DH-MAX.”...
View ArticlePlato and the Dormouse
During my summer vacation I read John Markoff’s book “What the Dormouse Said.” 1 I assume it was the publisher that insisted on adding the subtitle “How the Sixties Counterculture Shaped the Personal...
View ArticleRules
For a change, this post is actually related to NLP. It’s about rules, but not the linguistic kind, but those that govern our research communities. Not all proceedings are proceedings For ACL 2017, the...
View Article
More Pages to Explore .....