effectively using pdf as source
TRANSCRIPT
![Page 1: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/1.jpg)
USING PDF AS SOURCE
Liz Roscovius @LizRoscovius
1
#stc16
Mike Sawyer @Akambe
![Page 2: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/2.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
About us
Mike Sawyer @Akambe
Liz Roscovius @LizRoscovius
2
www.sawyerhome.net/stc2016
![Page 3: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/3.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
About you
3
![Page 4: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/4.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
4
Overview/Scope
• Explain PDFs and challenges using as source
• Getting file creation and property info • Exporting and importing entire PDF • Extracting and tweaking text and graphics • Formatting tips • Q&A
![Page 5: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/5.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
Why are PDFs still around?
5
• Portable Document Format
• Flexible display • Size-efficient • Self-contained • Available
![Page 6: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/6.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
6
• Limited edits • Explaining editability • Determining source
file format • Unlocking • Extracting usable/
editable text & graphics
Challenges
![Page 7: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/7.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
7
• Familiarity with Acrobat and DTP
• Full version of Acrobat • Unlocked PDFs • Keystrokes in examples
will be PC/Windows
Assumptions
![Page 8: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/8.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
8
• Restricts viewing, editing, copying, printing
• Original author can provide password
• Circumvent: PostScript, online tools
Password protection
![Page 9: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/9.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
9
Do you need to extract anything at all? • Latest Acrobat = Awesome tools • Change text (font embed) • Format text and paragraphs • Add text and graphics • Add headers, footers, watermarks
(including pagination)
Editing in place
![Page 10: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/10.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
10
• Author • Creation &
modification dates
• Software used
Collecting file information
![Page 11: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/11.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
11
• Security type • Restrictions
• Printing • Changing • Copying
Collecting file information
![Page 12: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/12.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
12
• Font usage • Match original
Collecting file information
![Page 13: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/13.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
13
• Layout • Line &
column breaks
• Tables • “A place for
everything”
Exporting problems
BEFORE AFTER
![Page 14: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/14.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
14
• File > Export To… • Word & RTF • Spreadsheet • Image • HTML & XML • Plain text
Exporting entire document
![Page 15: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/15.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
15
• Word (File > Open)
• CorelDraw (File > Import > AI > Text or Curves)
Importing entire document
![Page 16: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/16.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
16
Extracting editable text • Click & drag/
Control+A, then copy
• Pros & cons • Tagged info
![Page 17: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/17.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
17
Extracting non-editable text • Re-type • Export as graphic, run OCR • On-the-fly OCR in Acrobat
![Page 18: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/18.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
18
Avoiding the “extras” • Headers, footers,
pagination • Just crop them out
![Page 19: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/19.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
19
Cleaning up extracted text • Line breaks, hard
returns, extra spaces • Search/replace, esp.
character codes (InDesign: GREP)
• Tools: TextSoap, Textfixer, RecoSoft, PDF2ID
![Page 20: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/20.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
20
Graphics: Vector vs. raster • Understanding the differences
VECTOR RASTER
![Page 21: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/21.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
21
Extracting usable graphics • Convert to
Word/RTF • Open in drawing
program (Illustrator) • Other programs
![Page 22: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/22.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
22
Extracting usable graphics • Export from
Acrobat • When all else
fails: Zoom in and screen capture
![Page 23: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/23.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
23
Converting raster to vector • Open raster
image in Illustrator
• Window > Image Trace > [select preset/mode]
![Page 24: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/24.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
24
Recap • Limited editing but still usable • Takes some trial & error • Patience & perseverance
![Page 25: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/25.jpg)
#stc16 | USING PDF AS SOURCE | @LizRoscovius & @Akambe
25
Resources • www.sawyerhome.net/stc2016 • bit.ly/pdfsource • The difference between raster and vector • Tracing line art in Adobe Illustrator • Using the trace tool in Corel Draw 14 • Exporting PDFs to Microsoft Office formats • Acrobat help: converting PDFs to other file formats • Tips for cleaning up imported text • Using GREP to clean up imported text
![Page 26: Effectively Using PDF as Source](https://reader034.vdocuments.mx/reader034/viewer/2022042706/5882028d1a28abf05e8b4f5f/html5/thumbnails/26.jpg)
Mike Sawyer @Akambe [email protected]
CONTACT US: Liz Roscovius @LizRoscovius [email protected]
26
QUESTIONS? USING PDF AS SOURCE