Home » Blog » Currently Reading:

Converting Poorly Structured Legacy Content Creates Difficult Challenges When Moving To DITA

May 18, 2007 Blog 2 Comments

Content management and information design expert JoAnn Hackos discusses moving legacy documentation to the Darwin Information Typing Architecture (DITA) in this interview with Data Conversion Laboratory. Hackos defines legacy content, why and when converting content to DITA is desirable, how to decide what to convert (and what not to), and why writing content and designing new structures are a critical success factor when moving to DITA.

Hackos says: “Do not accept the old structures for the future. The new structure not only should be DITA; it’s going to be more structured and more effective. In the process you may find information that can move to DITA and information that could be dropped. You could devise a fairly sophisticated conversion script to do some of the work.”

“You might have content that is already well structured in the original,” says Hackos. “Conversion to DITA is going to give you a lot of value. Then you have some content that is so badly done that you don’t want to use it. This information may have to be completely rewritten. You might even need to start from scratch.”

“You will likely have a considerable middle ground-that’s where most people are. You have valuable information with good content. You want to move to DITA but in the process of getting there you want to make some intelligent decisions about using that content in the future. Put all your decisions about your content in at least three buckets: what to leave back as legacy, what to convert, and what to rewrite. One caution, if you convert information that is badly structured, it becomes even harder to fix later.”

Similar Posts:

Print Friendly
Tags:,

Currently there are "2 comments" on this Article:

  1. Marcus Carr says:

    Hackos says: “Do not accept the old structures for the future. The new structure not only should be DITA; it’s going to be more structured and more effective.”

    I say: What do you mean by “old structures”? My existing XML? My “old structures” were designed to precisely describe the dataset, so how is a move to a generic schema like DITA going to result in something more structured?

    Hackos says: “You might have content that is already well structured in the original,” says Hackos. “Conversion to DITA is going to give you a lot of value.

    I say: No it’s not.

    Hackos says: Then you have some content that is so badly done that you don’t want to use it. This information may have to be completely rewritten. You might even need to start from scratch.”

    I say: In over fifteen years of doing conversions spanning numerous industries, I’ve never met a client that valued the structure of their documents more highly than the content. It’s usually possible to get them to make concessions where you can demonstrate that the data is not consistently organized, but telling them to rewrite it to fit with the structure that you’ve adopted is simply unrealistic.

    Documentation problems are hard. Despite hype to the contrary, they don’t just dry up and blow away when they see DITA coming.

  2. ScottAbel says:

    Marcus:

    Thanks for your comments. I hear what you’re saying and think that this topic is certainly going to be an important one as the industry increasingly adopts structured XML content management methodologies.

    Scott Abel

    The Content Wrangler

Comment on this Article:

Subscribe to the Newsletter

Get The Content Wrangler Newsletter delivered straight to your home or work Inbox. It's full of content goodness.

Sponsors

Scriptorium
Content Rules
Dozuki
iFixit.com
oManual
Fractal Enterprise
LavaCon
Adobe FrameMaker
Gnostyx
STC
WordPress Consulting
MindTouch Techcomm
MindTouch 2
Grammar Girl
Acrolinx 1
SDL Live Content
JFM Concepts VDP Web
Smart TV San Francisco
Oxygen
MindTouch 1
Southern Polytechnic
Earley Associates Workshops
Content Rules 2
Text Wrangler
TC World Magazine

Recent Comments

  • DataComm Plus: Communication is challenge within itself. I believe that mos...
  • Barbara Saunders: I think the problem of writers who "think they are artists" ...
  • Mark Baker: @Marcia -- What is conventional wisdom for except to questio...
  • Mark Baker: @Joe -- Many of the things you might want to link on already...
  • Marcia Riefer Johnston: P.P.S. I love your title so much, in fact, that I've just a...

Readers

Subscribe by or


Archives