|
Update: May 21, 2008
New features: support for Dragon Naturally Speaking; audio processing (such as cut, silence, insert audio); and many editing feature enhancements.
What is the IBM Caption Editing System?
The amount of digital media containing audio is increasing rapidly, yet most of these clips contain inadequate captions and cause accessibility problems. Transcibing from audio is difficult, especially in languages such as Japanese or Chinese that use complex characters. Consequently, there is a demand for an easy and inexpensive captioning method.
Although voice recognition software can convert audio into text, it contains enough errors to make the captioning of digital media files undesirable. Correcting errors from a regular text editor and audio player is quite time-consuming. The IBM® Caption Editing System (CES) helps correct these errors quickly and easily.
Key concepts deployed in this CES Editing System are a Master Client Editing System; full synchronization with Powerpoint presentation slideshow; mouse-free operation; complete audio synchronization; and audio processing.
This package comes with the caption editing system, sample files, and all documentation for evaluating the system.
How does it work?
The IBM Caption Editing System consists of three subsystems: Recorder, Master Editor, and Client Editor. The basic steps for using CES are to record the content (with or without Powerpoint Slides) and then edit the errors from Voice Recognition.
Initially, CES Recorder is used to generate caption candidates from voice input (or file input) by encapsulated IBM ViaVoice® 10/10.5 or Dragon Naturally Speaking 9. Many people use the presentation slideshow when making presentations. If Microsoft Powerpoint is running as slideshow when recording, CES Recorder captures the screen and derives the text in each page. The user automatically has content with audio,slideshow, and caption (with some errors).
The next step is to use the CES Editor to edit the errors in the caption candidate for a complete caption. If there is only one editor, CES Master Editing System may be used to edit the caption. However, if multiple editors are available, it is possible to use the CES Master Client Editing System to effectively put together the work. The CES Master Client Editing System consists of one Master Editing System and a single or multiple Client Editing Systems connected over a LAN or FTP network. Client Editing is easy to use, so anyone without deep knowledge of the system can participate. In both systems, users can customize how the audio is automatically played.
The CES Master Editing System is rich in functionality. Microsoft Powerpoint presentation text, derived by the CES Recorder, can be effectively used for editing. If the user has a transcript available, he can use transcript matching to correct the corresponding caption. Furthermore, the system allows the user to edit the content layout.
|