SlideShare a Scribd company logo
Implementing a Customer-driven
Transition to DITA Content
A Step-by-Step Journey to Success
www.thecontentera.com 111/2/2017
President –
Director Tech Comm –
SVP Global Operations –
Founder/CEO –
Founder/CEO - Tom Aldous
Email: tom.aldous@thecontentera.com
Blog: www.tmaldous.com
YouTube: http://bit.ly/TheContentEra_YouTube
www.thecontentera.com 2 11/2/2017
- 30 Years in Technical Communication
- XML Expert
- Certified FrameMaker Expert
- Structured Authoring Wizard
- DITA Specialization Expert
- Content Strategist
- Industry Influencer
About Us
 We love solving problems in real-world conditions.
 We help you identify the tools and strategies that work best for your
content. Then, we train you to use them effectively.
 We can help you with structured content creation and management.
 Our knowledge of content tools, markup languages,
scripting/automation, and best practices reduces cycle time and
increases efficiency.
 We know your existing content is valuable: many of our projects
involve conversions to structured content and between XML
standards.
www.thecontentera.com 3 11/2/2017
Today’s goal and how we can help
 Demonstrate one client’s unique issues and showcase the TCE
solution.
 Every company has a unique set of requirements and issues. Cookie-
cutter solutions cannot address unique content.
 Existing content rarely translates directly to the DITA standard -
creative solutions built with XSLT, read-write rules and scripts
prepare your content for the transition.
 If you do not need to convert content and your solution is simple,
we can train you to modify templates to extend out-of-box
solutions.
www.thecontentera.com 4 11/2/2017
Challenges
 Categorize topics as concept, reference, or
task upon conversion.
 Minimize content rework before conversion
(such as untangling conceptual information
from procedures).
 Retain some formatting items, such
as table alignment.
 Retain some unstructured FrameMaker features
in early phases to support a phased transition.
 Address legacy content that inherently includes overrides,
formatting tags, cross-reference issues,
and other difficult-to-find remnants.
 Retain cross-references over topic chunking.
 Automate process as much as possible to avoid disruption
 Deliver a user-friendly publishing solution that allows import of
variables and conditions
www.thecontentera.com 5 11/2/2017
Creative solutions for unique content
www.thecontentera.com 6 11/2/2017
Structured Application
Scripts
Read/Write Rules
Stylesheets
Categorize topics
 How we did it:
 Apply unique paragraph tag only to the headings of reference and
conceptual information
 Creative handling of the conversion table to convert every topic to the
strictest topic type: task
 Stylesheet to correctly categorize topics into concepts, references, and
tasks
www.thecontentera.com 7 11/2/2017
Minimize content rework before conversion
 Some “topics” contain multiple
information types in the unstructured
environment
 How we did it:
 Convert procedures by nesting tasks to
generate valid XML upon conversion
 Let each authoring team define the
schedules for “untangling” content,
such as removing conceptual information
from procedures
www.thecontentera.com 8 11/2/2017
Retain some formatting items
 Some formatting items were
important to retain, such as table
alignment and styles
 How we did it:
 We retained table alignment by
wrapping the table element in a
p element, and using the p element
to drive alignment
www.thecontentera.com 9 11/2/2017
Retain some unstructured FM features
 Since our approach is a phased one, we need to retain variables and
conditional text in the early phases.
 How we did it:
 Use FrameMaker conditions as in
unstructured FrameMaker
 Wrap variables in ph element
and continue to use the FrameMaker
Variables dialog as in unstructured
FrameMaker
 Publishing solution imports variables
and conditions across the publication
www.thecontentera.com 10 11/2/2017
Address legacy content: character formats
 Character formats are sometimes incorrectly applied (do not use
character format tags)
 How we did it:
 Created script to search for incorrectly applied character formats and
capture these in the conversion table
www.thecontentera.com 11 11/2/2017
Address legacy content: formatting tags
 Many paragraph tags in the unstructured template are used solely
for formatting, and therefore need to be deleted
 How we did it:
 Created script to search for and delete formatting paragraph tags
www.thecontentera.com 12 11/2/2017
Address legacy content: X-ref issues
 Common problems with cross-references in legacy content include:
 Multiple paragraphs with identical X-Ref marker. This occurs because
new chapters were created from existing chapters (h1 X-Ref marker
duplicated when new chapter created).
 Paragraphs with more than one X-Ref marker. This occurs frequently
because index markers are mistakenly inserted as X-Ref markers.
 How we did it:
 Created script to add the chapter name as the prefix to every cross-
reference marker. This ensures X-Ref markers are unique.
 Created script to check for and fix paragraphs with multiple X-Ref
markers.
www.thecontentera.com 13 11/2/2017
Address legacy content: graphics
 Many legacy graphics include
multiple (and sometimes
hidden) frames
 How we did it:
 Created script to check every
anchored frame for composite
graphics
www.thecontentera.com 14 11/2/2017
Retain X-Refs over topic chunking
 How we did it:
 Merge book and then fix resulting unresolved cross-references
 Fix legacy cross-reference issues
 Created script to generate the correct link format for cross-references
(append idref to the href following # symbol)
www.thecontentera.com 15 11/2/2017
Automate process
 How we did it:
 Created a user-friendly workflow by:
 Allowing authors to run scripts from custom built menu in FrameMaker
 Using different structured applications during conversion and authoring
 Using Read/Write rules to roundtrip XML
 Using stylesheets to adjust structure where necessary
www.thecontentera.com 16 11/2/2017
Publishing solution
 How we did it:
 Leveraged and extended the out-of-the-box FrameMaker publishing
solution
 Used ditafm-output.ini file to define PDF publishing parameters
 Built FM book from ditamap
 Created script to import variables and conditional text settings from a
predefined “control” file (saved in the same folder as the ditamap)
 Generated book also used to output responsive HTML5 (using the
FrameMaker Publish dialog)
www.thecontentera.com 17 11/2/2017

More Related Content

Similar to LavaCon 2017 - Implementing a Customer Driven Transition to DITA - Part 1 (20)

PDF
The Trip to DITA
ClearPath, LLC
 
PDF
Structured Authoring for Business-Critical Content
LavaCon
 
PDF
Engineering Web Content (Web Content 2009)
Joe Gollner
 
PDF
Introduction to Content Engineering
Joe Gollner
 
PDF
LavaCon 2017 - Implementing a Customer-driven Transition to DITA Content: A S...
Jack Molisani
 
PPTX
Joining the structured highway: from first gear to the fast lane
Dominic Doherty
 
PDF
From Zero to DITA in about 60 Minutes
Publishing Smarter
 
PDF
LavaCon 2017 - Building an Enterprisewide Content Platform—and Why DITA will ...
Jack Molisani
 
PPTX
The Road to DITA
Wendy Shaffer
 
PPTX
Moving unstructured FrameMaker content to structure
Publishing Smarter
 
PPTX
Structured authoring for business-critical content
Jason Aiken
 
PPTX
Best practices when migrating to DITA
Publishing Smarter
 
PPTX
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
IXIASOFT
 
PPTX
Easy steps to convert your content to structured (frame maker and xml)
Publishing Smarter
 
PDF
Graat - Migrating to Structured Authoring
LavaCon
 
PPTX
STC Austin: Best practices when migrating to DITA
Publishing Smarter
 
PPTX
DITA Quick Start Webinar Series: Building a Project Plan
Suite Solutions
 
PPTX
DITA Quick Start Webinar Series: Building a Project Plan
Suite Solutions
 
PDF
LavaCon 2017 - Implementing a Customer Driven Implementation to DITA - Part 2.
Jack Molisani
 
PDF
Serving authorsneeds
Julie Atkins
 
The Trip to DITA
ClearPath, LLC
 
Structured Authoring for Business-Critical Content
LavaCon
 
Engineering Web Content (Web Content 2009)
Joe Gollner
 
Introduction to Content Engineering
Joe Gollner
 
LavaCon 2017 - Implementing a Customer-driven Transition to DITA Content: A S...
Jack Molisani
 
Joining the structured highway: from first gear to the fast lane
Dominic Doherty
 
From Zero to DITA in about 60 Minutes
Publishing Smarter
 
LavaCon 2017 - Building an Enterprisewide Content Platform—and Why DITA will ...
Jack Molisani
 
The Road to DITA
Wendy Shaffer
 
Moving unstructured FrameMaker content to structure
Publishing Smarter
 
Structured authoring for business-critical content
Jason Aiken
 
Best practices when migrating to DITA
Publishing Smarter
 
DITA Surprise, Unwrapping DITA Best Practices - tekom tcworld 2016
IXIASOFT
 
Easy steps to convert your content to structured (frame maker and xml)
Publishing Smarter
 
Graat - Migrating to Structured Authoring
LavaCon
 
STC Austin: Best practices when migrating to DITA
Publishing Smarter
 
DITA Quick Start Webinar Series: Building a Project Plan
Suite Solutions
 
DITA Quick Start Webinar Series: Building a Project Plan
Suite Solutions
 
LavaCon 2017 - Implementing a Customer Driven Implementation to DITA - Part 2.
Jack Molisani
 
Serving authorsneeds
Julie Atkins
 

More from Jack Molisani (20)

PDF
Keith Schengili-Roberts - DITA Worst Practices
Jack Molisani
 
PDF
LavaCon 2017 - Developing Your Edge: Getting a Seat at the Customer’s Table
Jack Molisani
 
PDF
LavaCon 2017 - How Modern Analytics Will Turn Your Technical Content Into a R...
Jack Molisani
 
PPTX
LavaCon 2017 - Agile Localization: Building Bridges Between Translation Quali...
Jack Molisani
 
PDF
LavaCon 2017 - Who Cares About Change?
Jack Molisani
 
PDF
LavaCon 2017 - Much Ado About Templates: Reduce the Learning Curve and Increa...
Jack Molisani
 
PPTX
LavaCon 2017 - Take the Risk, Embrace the Change!
Jack Molisani
 
PDF
LavaCon 2017 - Structured Content Authoring For All!
Jack Molisani
 
PDF
LavaCon 2017 - Building Catwalks Between Silos: Using Taxonomy to Drive Engag...
Jack Molisani
 
PDF
LavaCon 2017 - Getting Dragged Along? Start Charting Your Team’s Course with ...
Jack Molisani
 
PPTX
LavaCon 2017 - DITA: Start Small, Grow Big Using Open Source Tools
Jack Molisani
 
PDF
LavaCon 2017 - Feed the Goldfish in 19 Minutes and 52 Seconds
Jack Molisani
 
PDF
LavaCon 2017 - How UX and Content Can (and Should) Work Together
Jack Molisani
 
PPTX
LavaCon 2017 - Evolving the New Content Order
Jack Molisani
 
PPTX
LavaCon 2017 - Managing Stakeholders Across the Content Ecosystem: The Key to...
Jack Molisani
 
PPTX
LavaCon 2017 - Future-proof Your Content: Beyond Traditional Publishing for S...
Jack Molisani
 
PPTX
LavaCon 2017 - Silos. (And other concepts that make us average)
Jack Molisani
 
PPTX
LavaCon 2017 - Management Workshop Part 1: Leadership and Management in Techn...
Jack Molisani
 
PPTX
LavaCon 2017 - Stop, Listen, and Collaborate: Creating an Experience-first Co...
Jack Molisani
 
PPTX
LavaCon 2017 - Engineering Content 4.0 for a Digital World
Jack Molisani
 
Keith Schengili-Roberts - DITA Worst Practices
Jack Molisani
 
LavaCon 2017 - Developing Your Edge: Getting a Seat at the Customer’s Table
Jack Molisani
 
LavaCon 2017 - How Modern Analytics Will Turn Your Technical Content Into a R...
Jack Molisani
 
LavaCon 2017 - Agile Localization: Building Bridges Between Translation Quali...
Jack Molisani
 
LavaCon 2017 - Who Cares About Change?
Jack Molisani
 
LavaCon 2017 - Much Ado About Templates: Reduce the Learning Curve and Increa...
Jack Molisani
 
LavaCon 2017 - Take the Risk, Embrace the Change!
Jack Molisani
 
LavaCon 2017 - Structured Content Authoring For All!
Jack Molisani
 
LavaCon 2017 - Building Catwalks Between Silos: Using Taxonomy to Drive Engag...
Jack Molisani
 
LavaCon 2017 - Getting Dragged Along? Start Charting Your Team’s Course with ...
Jack Molisani
 
LavaCon 2017 - DITA: Start Small, Grow Big Using Open Source Tools
Jack Molisani
 
LavaCon 2017 - Feed the Goldfish in 19 Minutes and 52 Seconds
Jack Molisani
 
LavaCon 2017 - How UX and Content Can (and Should) Work Together
Jack Molisani
 
LavaCon 2017 - Evolving the New Content Order
Jack Molisani
 
LavaCon 2017 - Managing Stakeholders Across the Content Ecosystem: The Key to...
Jack Molisani
 
LavaCon 2017 - Future-proof Your Content: Beyond Traditional Publishing for S...
Jack Molisani
 
LavaCon 2017 - Silos. (And other concepts that make us average)
Jack Molisani
 
LavaCon 2017 - Management Workshop Part 1: Leadership and Management in Techn...
Jack Molisani
 
LavaCon 2017 - Stop, Listen, and Collaborate: Creating an Experience-first Co...
Jack Molisani
 
LavaCon 2017 - Engineering Content 4.0 for a Digital World
Jack Molisani
 
Ad

Recently uploaded (20)

PDF
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
PDF
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
PDF
Kubernetes - Architecture & Components.pdf
geethak285
 
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
PDF
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
PPTX
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
PDF
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
PDF
TrustArc Webinar - Navigating APAC Data Privacy Laws: Compliance & Challenges
TrustArc
 
PDF
Why aren't you using FME Flow's CPU Time?
Safe Software
 
PPTX
Reimaginando la Ciberdefensa: De Copilots a Redes de Agentes
Cristian Garcia G.
 
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
PDF
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
PDF
Pipeline Industry IoT - Real Time Data Monitoring
Safe Software
 
PDF
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
PPTX
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
PPTX
Mastering Authorization: Integrating Authentication and Authorization Data in...
Hitachi, Ltd. OSS Solution Center.
 
PPTX
2025 HackRedCon Cyber Career Paths.pptx Scott Stanton
Scott Stanton
 
PDF
How to Comply With Saudi Arabia’s National Cybersecurity Regulations.pdf
Bluechip Advanced Technologies
 
PDF
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
Optimizing the trajectory of a wheel loader working in short loading cycles
Reno Filla
 
99 Bottles of Trust on the Wall — Operational Principles for Trust in Cyber C...
treyka
 
Kubernetes - Architecture & Components.pdf
geethak285
 
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
SANGHEE SHIN
 
Hyderabad MuleSoft In-Person Meetup (June 21, 2025) Slides
Ravi Tamada
 
Smarter Governance with AI: What Every Board Needs to Know
OnBoard
 
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Safe Software
 
TrustArc Webinar - Navigating APAC Data Privacy Laws: Compliance & Challenges
TrustArc
 
Why aren't you using FME Flow's CPU Time?
Safe Software
 
Reimaginando la Ciberdefensa: De Copilots a Redes de Agentes
Cristian Garcia G.
 
5 Things to Consider When Deploying AI in Your Enterprise
Safe Software
 
Darley - FIRST Copenhagen Lightning Talk (2025-06-26) Epochalypse 2038 - Time...
treyka
 
Pipeline Industry IoT - Real Time Data Monitoring
Safe Software
 
Unlocking FME Flow’s Potential: Architecture Design for Modern Enterprises
Safe Software
 
Paycifi - Programmable Trust_Breakfast_PPTXT
FinTech Belgium
 
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Safe Software
 
Mastering Authorization: Integrating Authentication and Authorization Data in...
Hitachi, Ltd. OSS Solution Center.
 
2025 HackRedCon Cyber Career Paths.pptx Scott Stanton
Scott Stanton
 
How to Comply With Saudi Arabia’s National Cybersecurity Regulations.pdf
Bluechip Advanced Technologies
 
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
Ad

LavaCon 2017 - Implementing a Customer Driven Transition to DITA - Part 1

  • 1. Implementing a Customer-driven Transition to DITA Content A Step-by-Step Journey to Success www.thecontentera.com 111/2/2017
  • 2. President – Director Tech Comm – SVP Global Operations – Founder/CEO – Founder/CEO - Tom Aldous Email: [email protected] Blog: www.tmaldous.com YouTube: http://bit.ly/TheContentEra_YouTube www.thecontentera.com 2 11/2/2017 - 30 Years in Technical Communication - XML Expert - Certified FrameMaker Expert - Structured Authoring Wizard - DITA Specialization Expert - Content Strategist - Industry Influencer
  • 3. About Us  We love solving problems in real-world conditions.  We help you identify the tools and strategies that work best for your content. Then, we train you to use them effectively.  We can help you with structured content creation and management.  Our knowledge of content tools, markup languages, scripting/automation, and best practices reduces cycle time and increases efficiency.  We know your existing content is valuable: many of our projects involve conversions to structured content and between XML standards. www.thecontentera.com 3 11/2/2017
  • 4. Today’s goal and how we can help  Demonstrate one client’s unique issues and showcase the TCE solution.  Every company has a unique set of requirements and issues. Cookie- cutter solutions cannot address unique content.  Existing content rarely translates directly to the DITA standard - creative solutions built with XSLT, read-write rules and scripts prepare your content for the transition.  If you do not need to convert content and your solution is simple, we can train you to modify templates to extend out-of-box solutions. www.thecontentera.com 4 11/2/2017
  • 5. Challenges  Categorize topics as concept, reference, or task upon conversion.  Minimize content rework before conversion (such as untangling conceptual information from procedures).  Retain some formatting items, such as table alignment.  Retain some unstructured FrameMaker features in early phases to support a phased transition.  Address legacy content that inherently includes overrides, formatting tags, cross-reference issues, and other difficult-to-find remnants.  Retain cross-references over topic chunking.  Automate process as much as possible to avoid disruption  Deliver a user-friendly publishing solution that allows import of variables and conditions www.thecontentera.com 5 11/2/2017
  • 6. Creative solutions for unique content www.thecontentera.com 6 11/2/2017 Structured Application Scripts Read/Write Rules Stylesheets
  • 7. Categorize topics  How we did it:  Apply unique paragraph tag only to the headings of reference and conceptual information  Creative handling of the conversion table to convert every topic to the strictest topic type: task  Stylesheet to correctly categorize topics into concepts, references, and tasks www.thecontentera.com 7 11/2/2017
  • 8. Minimize content rework before conversion  Some “topics” contain multiple information types in the unstructured environment  How we did it:  Convert procedures by nesting tasks to generate valid XML upon conversion  Let each authoring team define the schedules for “untangling” content, such as removing conceptual information from procedures www.thecontentera.com 8 11/2/2017
  • 9. Retain some formatting items  Some formatting items were important to retain, such as table alignment and styles  How we did it:  We retained table alignment by wrapping the table element in a p element, and using the p element to drive alignment www.thecontentera.com 9 11/2/2017
  • 10. Retain some unstructured FM features  Since our approach is a phased one, we need to retain variables and conditional text in the early phases.  How we did it:  Use FrameMaker conditions as in unstructured FrameMaker  Wrap variables in ph element and continue to use the FrameMaker Variables dialog as in unstructured FrameMaker  Publishing solution imports variables and conditions across the publication www.thecontentera.com 10 11/2/2017
  • 11. Address legacy content: character formats  Character formats are sometimes incorrectly applied (do not use character format tags)  How we did it:  Created script to search for incorrectly applied character formats and capture these in the conversion table www.thecontentera.com 11 11/2/2017
  • 12. Address legacy content: formatting tags  Many paragraph tags in the unstructured template are used solely for formatting, and therefore need to be deleted  How we did it:  Created script to search for and delete formatting paragraph tags www.thecontentera.com 12 11/2/2017
  • 13. Address legacy content: X-ref issues  Common problems with cross-references in legacy content include:  Multiple paragraphs with identical X-Ref marker. This occurs because new chapters were created from existing chapters (h1 X-Ref marker duplicated when new chapter created).  Paragraphs with more than one X-Ref marker. This occurs frequently because index markers are mistakenly inserted as X-Ref markers.  How we did it:  Created script to add the chapter name as the prefix to every cross- reference marker. This ensures X-Ref markers are unique.  Created script to check for and fix paragraphs with multiple X-Ref markers. www.thecontentera.com 13 11/2/2017
  • 14. Address legacy content: graphics  Many legacy graphics include multiple (and sometimes hidden) frames  How we did it:  Created script to check every anchored frame for composite graphics www.thecontentera.com 14 11/2/2017
  • 15. Retain X-Refs over topic chunking  How we did it:  Merge book and then fix resulting unresolved cross-references  Fix legacy cross-reference issues  Created script to generate the correct link format for cross-references (append idref to the href following # symbol) www.thecontentera.com 15 11/2/2017
  • 16. Automate process  How we did it:  Created a user-friendly workflow by:  Allowing authors to run scripts from custom built menu in FrameMaker  Using different structured applications during conversion and authoring  Using Read/Write rules to roundtrip XML  Using stylesheets to adjust structure where necessary www.thecontentera.com 16 11/2/2017
  • 17. Publishing solution  How we did it:  Leveraged and extended the out-of-the-box FrameMaker publishing solution  Used ditafm-output.ini file to define PDF publishing parameters  Built FM book from ditamap  Created script to import variables and conditional text settings from a predefined “control” file (saved in the same folder as the ditamap)  Generated book also used to output responsive HTML5 (using the FrameMaker Publish dialog) www.thecontentera.com 17 11/2/2017