The Rise of Topic-Based Video in Task-Based Documentation: Is It Time For DITA and Video?
By Sean Healey, Owner, Wild Basin Media
The adoption rate of the Darwin Information Typing Architecture (DITA) has been truly remarkable. While its growth stems from and is most apparent across those organizations that publish technical documentation, DITA is certainly not limited to technical publication departments. Because of its specialization feature, DITA is now viewed by a variety of businesses as a stable, low-cost solution to all sorts of documentation from marketing to financial reports to training material and so on. By some estimates, up to 80 percent of new XML publishing implementations will be DITA-based by the middle of 2009.
Also on the rise is the use of video deployed to the Web, which should be of no surprise to even the occasional Internet user. Anecdotal evidence of this ascendancy has to include YouTube, the fastest growing site in Internet history. On average, 100 million videos are streamed from YouTube every day. 65,000 new video clips are uploaded every day. 13 million viewers visit YouTube every month. One has to look no further than the home page of the United States White House website to see how mainstream Web video has become and how much value has been put in its ability to communicate a message to a broad audience.
So why haven’t we seen more video integrations within structured documentation implementations such as DITA? At face value, video would seem to be a perfect fit with task-based procedural documentation because of its ability to show how to do a task or series of tasks.
The lack of such implementations is due in part because video is largely a black box. The information within it is rarely described with the necessary level of precision. Most viewers will not watch 10 minutes of video to find 30 seconds of relevant material. Many how-to video companies provide users with search capabilities; however, the level of granularity for these videos is usually too high to be applied to technical, task-based documentation where the ability to quickly find and understand discrete steps is key. Video chunked into relatively large pieces by typical methods is as unwieldy as coarsely-chunked unstructured documentation. Content reuse and searchability become at best difficult or at worse impossible.
What if steps within a DITA-based topic had video links to show how to complete a discrete step or procedure? We have all used search engines that allow us to enter criteria that return topics where the text is found. What if we entered search terms that take us to a particular video segment that demonstrates how to complete a specific task? What if we could browse clickable thumbnails to find relevant video segments? What if every time a video segment displays, its associated text-based topic displays as well?
MPEG-7 is an important component in the advancement of video-integrated documentation, and provides of bridge to these solutions. MPEG-7 is a multimedia (video, audio, voice, images, graphs, 3-D models, etc.) content description standard. Its tagged data can be passed into, or accessed by, devices or computer code. This description standard is not aimed at any one application in particular; rather, it supports a range of applications. MPEG-7 uses XML to store metadata that references multimedia instances. It provides fast and efficient searching, filtering, and content identification when coupled with XSLT or XQuery. MPEG-7 is not to be confused with other standards developed by the Moving Picture Experts Group (MPEG) such as MPEG-2 or MPEG-4, as these deal with the actual encoding of moving pictures. MPEG-7 makes it possible to separate the information about video (i.e., metadata) from the video itself. This separation of concerns provides the key to the multimedia search conundrum, using a standard XML toolset and workflow.
Elements of the MPEG-7 schema can be attached to video time-code segments in order to tag particular events within video such as particular tasks within a topic. These video segments are then retrieved and viewed through the corresponding metadata. Video components described by MPEG-7 can be integrated into online documentation and made accessible to users in a similar way as are, for example, text-based components in DITA. From an author’s perspective, adding video to a documentation set is as easy as inserting a resource id into the XML source—just like one would for an image. Like their text-based counterparts, video components may be reused across topics and documentation sets. MPEG-7 offers a standardized approach to the management of video metadata.
For the greatest user experience possible, video production must adhere to quality standards and best practices. For example, even though video segments may be remixed across multiple documents, captured by multiple video teams over time, and processed by more than one editor, for users video segments should all appear as if they have originated at the same time and in the same place (see consistency). Technical publications departments deal with disparate writing styles among their writing staff by using a style guide. Similarly, the medium of technical video requires its own style and best practices guides to govern lighting, shot angles, sound quality, video formats, common workflow processes, etc. Like text-based content and images, video segments are subject to revisions and updates. Fortunately, MPEG-7 also provides a systematic way of dealing with video revisions, too.
To be sure, there are special considerations when using video in technical documentation, but imagine the benefits of having a methodical way of viewing discrete steps within, for example, an airplane engine maintenance manual. Mechanics could search across MPEG-7 repositories to find and view appropriate video segments and read the associated DITA-based documentation adjacent it for further reference. Because many parts and procedures are identical across plane models, standard reuse and effectivity principles can be employed for video in the same way as for text-based content. Just as task-based documentation is modularized so that new documents can be created from pieces of existing text, video segments, each assigned a unique id, can be reordered into new configurations. In short, the same themes (e.g., reuse, modularization, metadata employed for increased searchability, reduction of information redundancy) that run through the design of structured documentation can be applied to video with the help of MPEG-7 and associated technologies.
To this point, I’ve only mentioned camera-generated video as benefiting from an MPEG-7 implementations when applied to structured documentation. However, software-generated video (i.e., screen capture software such as Camtasia, Jing, Captivate, and IShowU) can be segmented, managed, and deployed in the same way. For example, it is possible to re-purpose your existing walk-through video of your latest software product, so that video segments can be searched and viewed. Because each segment is allocated a unique id and associated with metadata, segments can be reordered and reused where appropriate. This type of video is not equipment or software intensive. In fact, you may already have almost everything you need already.
With new video standards and practices, it is now possible to merge video seamlessly into structured documentation for the Web with greater control over its management, search and presentation. The convergence of greater bandwidth speeds, improvements in production tools, and cost-cuts in video production workflows make video-integrated documentation a viable enhancement to your structured documentation – technical or otherwise. It is conceivable that in the near future, video will be an integral part of structured documentation, just as it has in the Web at large.
The Content Wrangler























There are some issues with video in task-based doc.
For one, the cost is steep, at about one day per minute of high-quality finished video. Especially a short video is to convey information accurately, it’s important to pay careful attention to the things in the background or foreground—such as the default items in a list, or the values in the sample data. Since time between video projects is usually months, we are at best perpetual intermediates, re-learning the same feature set over and over, and never reaching the efficiency of a frequent user.
For two, video takes up a lot of space. I love using TechSmith Camtasia’s zoom-and-pan feature, and I love the control I get over the interaction and captions that a demo movie models in Adobe Captivate (though I should say the default captions are rarely right), but I wish it were easy to get both in one place. The important action can almost always be conveyed in a small window, but since interfaces are often larger, it takes a larger stage to show it all. (This is why Camtasia’s zoom-and-pan feature is such a delight.) TechSmith Jing is great, but allows no editing at all. The most frustrating thing is that neither Camtasia nor Captivate are intended for documentation content creators—thought that’s how Captivate started out, when it was still named RoboDemo.
Thanks for sharing your thoughts, Jerome. I don’t think video production has to be as difficult, time-consuming, or space hogging, etc. as you suggest, but I know it can be.
In my view, it will be interesting to see what happens as text-based documentation is replaced (and in many instances it should be) or accompanied by video documentation (in situations where this additional content provides value). As always, just because a project like this takes effort, storage space, or new software, skills, etc. is no reason it shouldn’t be pursued. I believe video documentation is the emerging field of technical communication with huge growth potential. Only time will tell if I’m correct.
What’s really exciting is that people are doing it today. In fact, here’s a link to a set of task-based online help files that has been beefed up with video (and encoded in DITA!) http://screencast.com/t/AZn6ijjbP (no sound yet, just video).
A few case studies will be available soon. And, there are some vendors working on top secret video and DITA projects right now. I can’t wait to see what they come up with.
Scott Abel
The Content Wrangler
If you’re not yet a member of The Content Wrangler Community, join us today! http://thecontentwrangler.ning.com
An excellent article. Searchability is the one obstacle I have to overcome in order to have viedo screencasts replacing user guides.
is the fact that video is the best media to deliver the notion that our GUI is superb and is very easy to be learned.
Video is faster to be read/viewed than textual instruction set. It’s also faster to create. More importanty (even mre important than my time
As of today, I have a 45-90 seconds video for each task, with the metadata written (as text) on the website where these videos are available. There’s so much to be improved here, and reading this article is an excellent starting point.
Really thought-provoking article, Sean. We’re going to be hearing about this more in the future, I’m sure. I’d love to see some real-world examples of DITA/video in action, if any exist. (The link Scott posted didn’t work, unfortunately.)
Thank you for your words of encouragement and interest in video mashups with DITA and other XML authoring formats.
For me, video integration is the next logical step in the evolution of DITA ;->. It’s not just that video is cool (well, video is cool, but coolness is hard to measure in ROI metrics terms). Relevant visual media coupled with text-based content has been shown to improve learning up to 87% (Dr. Ruth Clark and Dr. Richard Mayer, UC Santa Barbara). Of the visual media, video is high on the food chain. Research at University of Pittsburgh compared the effects of video with printed text, still pictures with text, and text alone. Video-with-text presentations produced results that were significantly superior to still-picture-with-text presentations (Dr. Khalid Al-Seghayer).
Video is not just a passing trend or window dressing. Done well, it makes the user experience better, and better search mechanisms through segmentation is a way of respecting our users’ time – our time – as we are all information consumers. The side effect of being better educators, is that we enhance our company’s bottom line. Is this connection tenuous? Show me a company with crappy help and I’ll show you a company that is hemorrhaging money by overtaxing its support team, handicapping its marketing effort, alienating users unwilling to wade through a swap of data, and may not fully understand the information re/evolution we live in. Information democratization means choices: if we don’t like the candidate, we move on.
The perception that video is costly to produce is a very real obstacle. It really depends on the scope of the project. There are three points that I’d like to make with regard to cost:
1. Reuse: Segments are IDed. With segment IDs, we enter into an object-oriented framework, where we can surgically update specific video segments. This reduces production costs over time, especially in an CMS construct.
2. What if you could put together a video mashup using open tools and standards “for free”. If Scott is amenable to such a series, I’ll be putting together a set of tutorials that will help tech pubs shops assemble the major building blocks of such a system. Stay tuned.
3. What is the cost of not taking a serious look at video as one more tool in your company’s toolbox? While I am dubious of some of the user-generated video found on the Web at large with regard to its educational value, YouTube does give us a glimpse through the glass darkly of the potential/inevitable magnitude of video’s role in online documentation and training.
Last week’s DocTrain West Conference was the “debut” of the video-integrated documentation demo. The positive feedback at the conference and into this week has far surpassed my expectations. Yesterday, a friend asked me how the concept of video/DITA mashups was received at the DocTrain. I told him that I feel a bit like Quint when he sees the shark for the first time and says, “I think we need a bigger boat.”
Mark,
My mistake…. I must have passed Scott a dead link. Try these:
Camera-generated video
http://screencast.com/t/z0OyhgXn6cV
Software-generated video
http://screencast.com/t/gQpxl6Fiz
I’ve had requests to post my slides from the DocTrain West Conference. Feel free to contact me if you would like more elaboration. I’ve also included a URL to an informational sheet re: video-integrated documentation. Some may be interested in the general DITA resources on the DITA & XML Community of the Rockies blog.
Download Slides
Download Informational Sheet
DITA & XML Community of the Rockies