Sebastian Heath is a user on octodon.social. You can follow them or interact with them if you have an account anywhere in the fediverse. If you don't, you can sign up here.
Sebastian Heath @sebhth

On an experimental basis, I'm working with a visiting student to distribute born-digital html-based articles as "stand alone" files that include all assets and are easily downloaded. See:

isawnyu.github.io/isaw-papers-

They key is that images are included as base64-encoded data urls. The strategic goal is to explore as an open and easily parseable equiv to PDF. More work to do, but I'm pleased with her progress.

· Web · 2 · 3

@sebhth It sounds interesting, but why choose it over EPUB? I'm guessing it's because Web browers are already set up for HTML, but to compete with PDF a format optimally needs to support pagination for citation purposes. I looked at some of the examples via the link and they seemed unpaginated.

@GardenOfForkingPaths

One quick answer:

Here are the ePub+Mobi versions of same content:

isawnyu.github.io/isaw-papers-

Longer:
The "standalone" files have RDFa + other metadata. They remain as parseable as any html file. More amenable to various forms of computational analysis.

Longer, longer:
In general, ISAW Papers (the journal) avoid pagination. Each paragraph is directly citable as a more fine grained solution. E.,g.,

dlib.nyu.edu/awdl/isaw/isaw-pa

Many thanks for responding/asking!

@sebhth @GardenOfForkingPaths

Yeah, pagination is problematic in formats that reflow in viewers. I agree that we need to move toward citation mechanisms that correspond to semantic structures inherent in the work, rather than arbitrary chunking driven by a particular display medium, and can be encoded across all formats used.