Distinguishers in Morphology

A few years ago, I was introduced by Greg Stump to the notion of distinguishers in morphological description. The analysis of inflected forms in terms of theme + distinguisher is a very helpful concept and one that is made use extensively in my ongoing work on New Testament Greek morphology.

read more...

Atom Editor 1.1 Fixes Polytonic Greek Bug

Release 1.1 of GitHub’s Atom Editor fixes a problem I had with using it for polytonic Greek.

read more...

Renaming Non-Indicative Tense-Forms

I think it’s confusing that we name the non-indicative tense-forms with the same terms as indicative tense-forms. For example “present indicative” and “present infinitive”. The word “present” doesn’t mean the same thing in both cases.

read more...

An Experimental REST API to MorphGNT

Back in July, I thought I’d prototype a REST API for MorphGNT with resources for books, paragraphs, sentences, verses and words.

read more...

The Core Vocabulary of New Testament Greek

In a 2008 paper, Wilfred Major constructs what he calls the 50% and 80% vocab lists for Classical Greek. That is, the lemmata that account for 50% and 80% respectively of tokens in the Classical Greek corpus. In this post I provide the code for the equivalent for the Greek New Testament and talk about some of the results.

read more...

Mean Dependency Depth

With dependency paths calculated for the Greek New Testament, we can use mean dependency depth as a proxy for syntactic complexity.

read more...

Dependency Paths

For numerous corpus linguistics applications, it’s useful to have a word-level indication of syntax. A presentation by Vanessa and Robert Gorman gave me the idea of using dependency paths for this purpose so I’ve now calculated them for the GNT based on the GBI syntax trees.

read more...

Mean Log Frequency of Lexemes

One component of many readability measures on texts is the mean log word frequency. Here I do a basic calculation across chapters in the Greek New Testament (with code provided).

read more...

Updated Vocabulary Coverage Statistics

In various mailing list posts, blog posts and talks, I’ve shown vocabulary coverage statistics. It’s time to update the code to use more recent data and republish the results here.

read more...

Blogging Every Day Between Now and SBL Annual Meeting

It’s exactly four weeks until I’m presenting at the SBL Annual Meeting in Atlanta. As I have a long backlog of posts I’ve wanted to do for a while, I thought I might try to blog every day between now and my talk on November 22nd.

read more...