The Data Hiding Inside Ebooks

I recently met with one of the founders of a startup pushing the boundaries of what’s possible in ebooks. We discussed what impact more technology and better insight into reader behavior would have on publishing. Could books be more like web sites or apps in that regard? Would that be a better experience for readers, writers, and publishers?

A powerful and sometimes scary thing about web sites is the data they collect about their users. As a site owner, it helps to know which of my pages attracted the most visitors, where those visitors came from, and how long they stayed. Given this data, I can add more content similar to what’s worked before, avoid what hasn’t, and build an audience by promoting my site on other sites that send me quality traffic. Improving my site is rarely as simple as site A sends me more people than site B so I’ll pander to that audience but regardless, some insight into my audience is better than none.

Now apply these ideas to a blog. If I know what posts attract readers and I write more on similar topics, that data is now helping me as a writer in addition to a web site owner. I get to understand my readers in a whole new way. Sure, this type of feedback isn’t as personal and human as a comment or a conversation but it’s feedback nonetheless. The real question is: why isn’t data like this available for writers beyond their blogs? Most, if not all, e-readers have Internet access. The popular book formats (Kindle’s .mobi, Apple’s .ibooks, and everyone else’s ePub) are built on the core language of the web (HTML). In essence, what’s possible on the web today should be possible in e-readers today.

A valuable piece of web site data is the exit rate for its pages. The exit rate is what percent of people go to a different site from a given page. A page with a high exit rate is one where lots of people drop off. Imagine if ebooks understood exit rates. It could help a novelist see how far readers got in a book. The chapter, section, or page with the highest exit rate would be a great candidate for revision. The writer could see where he or she loses readers with zero effort; they would just read and stop reading whenever they felt like it. If a writer had this data and was willing to act on it, he or she could even update the ebook with revisions and measure success with new readers.

Even basic data could help answer valuable questions about any given ebook for the writer and publisher:

About the free sample:

  • Of the people who downloaded the free sample, how many finished?
    Suggests how successful the sample is.
  • Where did most people stop reading the free sample?
    Suggests if the sample needs work or if a new sample is needed.
  • At which point in the free sample did they buy?
    Suggests how successful the sample is.

About the book as a whole:

  • How many people started? Finished?
    Suggests how successful the story is overall. Can show the author improving when compared with previous work.
  • How many bookmarks, notes, annotations, and shares did readers make?
    Suggests how much impact the story had on the reader.
  • How often did people read?
    Suggests how engaged the readers are.
  • How long did it take to finish?
    Suggests how engaged the readers are.
  • How long was a typical reading session?
    Suggests how engaged the readers are.

About the reading experience:

  • What did they read before this?
    Suggests opportunities for marketing and cross-promotion.
  • What did they read after?
    Suggests opportunities for marketing and cross-promotion.

Some of this data would be pushing at the privacy boundaries of readers so I would make it all aggregated, anonymized, and requiring readers to opt-in.

As a reader, would you be willing to opt-in and passively share this data about your interactions with a book?

As a writer, editor, or publisher, would you use this data as part of your process of editing and gathering feedback?

4 thoughts on “The Data Hiding Inside Ebooks”

  1. While I do agree that it would help authors figure out what is liked or not liked in their books and I do agree that the first 50-100 pages are some of the most important in any book, I think that authors would benefit more by putting a link to an opinion page in the ebooks. Also, not all ereaders have internet.

    It’s a good idea but I think the data would be flawed by too many variables. There are people who flip to the last page before they even start a book. There are people who read slowly and others who read quickly. I do like the fact that it will be optional. I was thinking all the way through reading your email that it could be misused in many different ways and that I would probably not want to use it simply because I wouldn’t want my reading data being sent off to be seen by people I don’t know.

    In the long run I think that it would cause more of a uniformed writing style that caters to the masses rather than the people who like certain books simply because they are different. Have you heard about the sites that pretty much write screenplays for you? They provide outlines for what a poplar movie should be rather than allowing/forcing a writer to be creative. I call them cookie cutter movies. They suck lol

    1. Great points! I think clever statistics and data crunching could compensate for readers doing less predictable things like flipping to the back of the book. It does brings up the interesting point: can a reading experience be quantified in any useful way?

      I don’t know if in the long run making this data available would create a more uniform writing style. There will always be hacks and uninspired things published. I would expect a good author to follow his instincts and fine-tune with this data without completely relying on it.

  2. I like the idea a lot, and I like that it’s optional. My only topic of inquiry would be the underlying assumption that what’s popular is equivalent to what is good writing. I’d argue that a writer doesn’t improve their writing style by looking at exit rates and stuff, but only makes it more sellable. (I think Myst commented on this idea too).

    To be fair, however, the kind of author who would be this deep in techie, e-Reader type dealings would already be interested in keeping up with the bandwagon of modern readers and tapping into the potential there; this idea serves a very particular niche (and there are plenty of other authors who aren’t in this internet/e-Reader-based system). Given that, all the ideas in the post are interesting and, even used to promote marketability, serves to promote the new e-Reader culture and illuminate for authors components of their work that would otherwise be inaccessible.

    1. I doubt we’ll quantify good writing any faster than consciousness or subjective human experience. But making your writing more sellable can be a wonderful thing, especially if you’re self-published. You said it yourself, it’s all about illuminating what would otherwise be inaccessible.

Leave a Reply to Myst Cancel reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.