thai-language.comInternet resource
for the Thai language
Lookup:
» more options here
Browse

F.A.Q. Check out the list of frequently asked questions for a quick answer to your inquiry

e-mail the author
guestbook
site settings
site news
bulk lookup
Bangkok
Thanks for your

recent donations!

Narisa N. $+++!
John A. $+++!
Paul S. $100!
Mike A. $100!
Eric B. $100!
John Karl L. $100!
Don S. $100!
John S. $100!
Peter B. $100!
Ingo B $50
Peter d C $50
Hans G $50
Alan M. $50
Rod S. $50
Wolfgang W. $50
Bill O. $70
Ravinder S. $20
Chris S. $15
Jose D-C $20
Steven P. $20
Daniel W. $75
Rudolf M. $30
David R. $50
Judith W. $50
Roger C. $50
Steve D. $50
Sean F. $50
Paul G. B. $50
xsinventory $20
Nigel A. $15
Michael B. $20
Otto S. $20
Damien G. $12
Simon G. $5
Lindsay D. $25
David S. $25
Laurent L. $40
Peter van G. $10
Graham S. $10
Peter N. $30
James A. $10
Dmitry I. $10
Edward R. $50
Roderick S. $30
Mason S. $5
Henning E. $20
John F. $20
Daniel F. $10
Armand H. $20
Daniel S. $20
James McD. $20
Shane McC. $10
Roberto P. $50
Derrell P. $20
Trevor O. $30
Patrick H. $25
Rick @SS $15
Gene H. $10
Aye A. M. $33
S. Cummings $25
Will F. $20
Get e-mail

Sign-up to join our mail­ing list. You'll receive e­mail notification when this site is updated. Your privacy is guaran­teed; this list is not sold, shared, or used for any other purpose. Click here for more infor­mation.

To unsubscribe, click here.

PDF copy and paste to Word

Usage questions, bug reports, and feature requests for this web site

Moderator: acloudmovingby

PDF copy and paste to Word

Postby John » Sat Jan 12, 2008 3:57 pm

Hi,

An ongoing frustration for me has been the inability to copy Thai text from a PDF file (using Acrobat Reader) and paste to a Word document. The text I am copying has both English and Thai. The English pastes fine but the Thai comes out as symbols (i.e. "Aπ—s ◊O¬OAaπ oŸa‡¢’¬ πAπ—s ◊O¬")

Anyone have a solution?

John
John (จอห์น)
User avatar
John
 
Posts: 728
Joined: Mon Feb 14, 2005 12:00 am
Location: Rescue, CA USA

PDF copy and paste to Word

Postby David and Bui » Sat Jan 12, 2008 5:07 pm

Let me try:

มีผู้แนะนำ สั่งสอน ที่ปรึกษา เพื่อนที่คบหา
และบุคคลผู้แวดล้อมที่ดี, ความรู้จักเลือกเสวนาบุคคล

กัลยาณมิตตตา (ความมีกัลยาณมิตร คือ มีผู้แนะนำ สั่งสอน ที่ปรึกษา เพื่อนที่คบหา
และบุคคลผู้แวดล้อมที่ดี, ความรู้จักเลือกเสวนาบุคคล หรือเข้าร่วมหมู่กับท่านผู้ทรงคุณทรง
ปัญญามีความสามาFรถ ซึ่งจะช่วยแวดล้อม สนับสนุน ชักจูง ชี้ช่องทาง เป็นแบบอย่าง ตลอด
จนเป็นเครื่องอุดหนุนเกื้อกูลแก่กัน ให้ดำ เนินก้าวหน้าไปด้วยดี ในการศึกษาอบรม การครอง
ชีวิต การประกอบกิจการ และธรรมปฏิบัติ, สิ่งแวดล้อมทางสังคมที่ดี — Kalyànamittatà:
having good friends; good company; friendship with the lovely; favourable social
environment) ข้อนี้เป็น องค์ประกอบภายนอก (external factor; environmental factor)
David in Houston
David and Bui
 
Posts: 6183
Joined: Sat May 14, 2005 12:00 am
Location: Houston, Texas

PDF copy and paste to Word

Postby David and Bui » Sat Jan 12, 2008 5:10 pm

The first sentence above is from pdf directly to Firefox. The second was from pdf to Word 2007 to Firefox. Both are now visible in the previous post. Can you see the Thai?
David in Houston
David and Bui
 
Posts: 6183
Joined: Sat May 14, 2005 12:00 am
Location: Houston, Texas

PDF copy and paste to Word

Postby John » Sat Jan 12, 2008 5:16 pm

David,

I can see the text you posted in Thai.

Do you think there is there a Windows XP setting that is messing me up?

John
John (จอห์น)
User avatar
John
 
Posts: 728
Joined: Mon Feb 14, 2005 12:00 am
Location: Rescue, CA USA

PDF copy and paste to Word

Postby David and Bui » Sat Jan 12, 2008 5:28 pm

Many's the slip between cup and lip.

Lot's of bad things can happen in converting one format to another. Here are some of what I can guess at:

1. The pdf Thai sentence that you are trying to port from pdf to Word is only a picture, not text. If the Thai works fine in Word, then this is not the problem.

2. Try to paste the pdf-copied text into notepad and save as a text file. If the Thai looks o.k. then Word might be the problem.

3. In Word, do you have any other difficulties with Thai? In Word's "Font" function, can you read the Thai script setup in the "complex script" function? There should be a "preview box" below the complex script area which shows you a sample. Can you read the Thai in that context?

4. Try copying and pasting directly from pdf to Internet Explorer or Firefox and see if the Thai comes out correctly. If so, the problem would seem isolated to Word.

Good luck.
David in Houston
David and Bui
 
Posts: 6183
Joined: Sat May 14, 2005 12:00 am
Location: Houston, Texas

PDF copy and paste to Word

Postby John » Sat Jan 12, 2008 5:46 pm

David,

1. Is it possible that the PDF text is a "picture" and not actual text to protect a copyright?

2. Copying into other programs hasn't helped.

3. Word and other Microsoft products handle the Thai script just fine. I have many documents I have produced in Thai.

4. I don't know how to paste into Firefox or IE unless you are talking about pasteing into a lookup box.

Do you have a PDF file you could refer me to or sent me? Maybe my source file is the issue.

Thanks,

John
John (จอห์น)
User avatar
John
 
Posts: 728
Joined: Mon Feb 14, 2005 12:00 am
Location: Rescue, CA USA

PDF copy and paste to Word

Postby David and Bui » Sat Jan 12, 2008 7:11 pm

John, see PM.
David in Houston
David and Bui
 
Posts: 6183
Joined: Sat May 14, 2005 12:00 am
Location: Houston, Texas

PDF copy and paste to Word

Postby John » Sun Jan 13, 2008 7:51 am

David,

Thank you so much for your assistance. The PDF text you sent cuts and pastes fine. The Thai text I am using must be copy protected.

John
John (จอห์น)
User avatar
John
 
Posts: 728
Joined: Mon Feb 14, 2005 12:00 am
Location: Rescue, CA USA

Re: PDF copy and paste to Word

Postby glotcha » Sat Jun 29, 2019 5:45 am

I had had this problem after quite a bit of trying I found this post that explains exactly why it's not possible to copy and paste thai characters from certain PDF files.

https://superuser.com/questions/92615/c ... f-document

"Normal PDF documents containing Unicode text do not store the text as characters - but as references to the glyphs (letter shapes) in the fonts used."

So the strange characters you see when copying are not some kind of text encoding, as I originally suspected, but a code internal to the pdf document which tell the reader which character to display. If the embedded fonts do not have "translate to unicode" support, it is not possible for the user to copy the thai characters seen on the screen.
glotcha
 
Posts: 1
Joined: Thu Jun 27, 2019 7:34 pm


Return to thai-language.com: Issues, Problems, Suggestions

Who is online

Users browsing this forum: No registered users and 3 guests

Copyright © 2019 thai-language.com. Portions copyright © by original authors, rights reserved, used by permission; Portions 17 USC §107.