ACM Symposium on Document Engineering 2007: Winnipeg, Manitoba, Canada
Peter R. King, Steven J. Simske (Eds.): Proceedings of the 2007 ACM Symposium on Document Engineering, Winnipeg, Manitoba, Canada, August 28-31, 2007. ACM 2007 ISBN 978-1-59593-776-6
Working session
Ethan V. Munson: Document engineering education. 1
Keynote address
Margaret-Anne D. Storey: Navigating documents using ontologies, taxonomies and folksonomies. 2
Paper documents: capture and physical-digital-coexitence
Shijian Lu, Chew Lim Tan: Thresholding of badly illuminated document images through photometric correction. 3-8
Weihua Huang, Chew Lim Tan: A system for understanding imaged infographics and its applications. 9-18
Nadir Weibel, Moira C. Norrie, Beat Signer: A model for mapping between printed and digital document instances. 19-28
Kosuke Konishi, Naohiro Furukawa, Hisashi Ikeda: Data model and architecture of a paper-digital document management system. 29-31
Carlos A. B. Mello: A new Tsallis entropy-based thresholding algorithm for images of historical documents. 32-34
Poster session
Variable data printing
Fabio Giannetti: A multi-format variable data template wrapper extending podis PPML-T standard. 37-43
Steven R. Bagley, David F. Brailsford, James A. Ollis: Extracting reusable document components for variable data printing. 44-52
Royston Sellman: VDP templates with theme-driven layer variants. 53-55
Alexander J. Macdonald, David F. Brailsford, Steven R. Bagley, John Lumley: Speculative document evaluation. 56-58
XML documents
Seung Min Kim, Suk I. Yoo, Eunji Hong, Tae Gwon Kim, Il Kon Kim: A document object modeling method to retrieve data from a very large XML document. 59-68
Gersende Georg, Marie-Christine Jaulent: A document engineering environment for clinical guidelines. 69-78
Deise de Brum Saccol, Nina Edelweiss, Renata de Matos Galante, Carlo Zaniolo: XML version detection. 79-88
Keynote address
Sara Church: Bank notes: extreme doceng. 92
Demonstrations
Fabio Giannetti: Anvil next generation: a multi-format variable data printtemplate based on PPML-T. 93-94
Ludovic Gaillard, Marc Nanard, Peter R. King, Jocelyne Nanard: Intention driven multimedia document production. 95-96
Fabrice Matulic: Touch scan-n-search: a touchscreen interface to retrieve online versions of scanned documents. 97-98
Tudor Groza, Alexander Schutz, Siegfried Handschuh: The salt triple: framework editor publisher. 99-100
Multimedia
Dick C. A. Bulterman, A. J. Jansen, Pablo César, Samuel Cruz-Lara: An efficient, streamable text format for multimedia captions and subtitles. 101-110
Marc Nanard, Jocelyne Nanard, Peter R. King, Ludovic Gaillard: Genre driven multimedia document production by means of incremental transformation. 111-120
Cyril Concolato, Jean Le Feuvre, Jean-Claude Moissinac: Timed-fragmentation of SVG documents to control the playback memory usage. 121-124
Layout and aesthetics
Kim Marriott, Peter Moulder, Nathan Hurst: Automatic float placement in multi-column documents. 125-134
Hervé Déjean, Jean-Luc Meunier: Logical document conversion: combining functional and formal knowledge. 135-143
Hui Chao, Prasad Gabbur, Anthony Wiley: Preserving the aesthetics during non-fixed aspect ratio scaling of the digital border. 144-146
Extending document engineering formats

Matthew R. B. Hardy: The Mars project: PDF in XML. 161-170
Tudor Groza, Alexander Schutz, Siegfried Handschuh: SALT: a semantic approach for generating document representations. 171-173
John Lumley, Roger Gimson, Owen Rees: Endless documents: a publication as a continual function. 174-176
Classification and machine learning
Michael G. Noll, Christoph Meinel: Authors vs. readers: a comparative study of document metadata and content in the www. 177-186
Eunyee Koh, Daniel Caruso, Andruid Kerne, Ricardo Gutierrez-Osuna: Elimination of junk document surrogate candidates through pattern recognition. 187-195
Tun Thura Thet, Jin-Cheon Na, Christopher S. G. Khoo: Filtering product reviews from web search results. 196-198
Jie Zou, Daniel X. Le, George R. Thoma: Structure and content analysis for html medical articles: a hidden markov model approach. 199-201
Nadia Zerida, Nadine Lucas, Bruno Crémilleux: Exclusion-inclusion based text categorization of biomedical articles. 202-204
Baoli Li, Neha Sugandh, Ernest V. Garcia, Ashwin Ram: Adapting associative classification to text categorization. 205-208
Document transformation
Thomas Triebsees, Uwe M. Borghoff: Towards automatic document migration: semantic preservation of embedded queries. 209-218
Catherine Pugin, Rolf Ingold: Combination of transformation and schema languages described by a complete formal semantics. 222-224



