And the final bit of coding today. I started working on the XPath-based splitter. This is what is going to break apart a chapter into paragraphs, lines, and tokens/words.
That way, an echo plugin can search for `//token[length() > 3]` to get all tokens over three characters long.
`//para//token[index() = 1]` for the first word of every paragraph. I want to use that to make sure no paragraph starts with the same word.
I figured I'm 2-4 days from seeing if this POC works.