jump to navigation

Convert/Import from PDF and Keep the Formatting April 10, 2007

Posted by Carthik in applications, guides, microsoft, office, ubuntu.
trackback

I have often wanted to convert a PDF file to a MS Word (.doc) file or an openoffice.org file. Usually I just copy the text from the PDF file and paste it in the new word document. Soon, this gets pretty tiring.

Recently I found a way to convert a pdf file to other formats, including .doc and .odt which preserves the formatting of the text pretty well. It is not perfect preserved but it is way better than having no formatting at all.

The secret goes by the name KWord. KWord is a KDE application that has a pdf “import” feature which lets you import either entire pdf documents or just a few pages from a pdf document while preserving the formatting! Of course – this only works for pdf documents which are not scanned images of pages. I tried it out on files created using \LaTeX, MS Word and OpenOffice. The font sizes in the imported document are larger than they need to be, but at least the headings are heading, the normal text is normal text, and the bullets are bullets!

KWord - file menu
Start the import using the “File” -> “Import” option in the main KWord menu.

Kword - import pdf
After you select the pdf file to be imported, you will see a window like the one above where you can specify the pages you want to import. I did not change the default selected options – changing them and seeing what happens is an exercise left to the reader. :)

Of course, if you want to install KWord on your Ubuntu system, you can run the following command from the terminal window:
$sudo apt-get install kword

Then you can launch KWord using:
$KWord &
or by clicking the entry for KWord in the menu on your desktop.

Comments»

1. Dirk Gently - April 10, 2007

This sounds useful. I’ve been trying to find a way to preserve the text alignment in particular….as it seems to forget where it was, break the sentences where they should wrap and think everything should be “left”……is this a left wing conspiracy???? lolololol

2. .bootstrap - April 10, 2007

Convert/Import from PDF and Keep the Formatting

What a great thing – KWord from KDE Office seems to be the solution.

3. Nelson - April 10, 2007

Wow! Really useful! You’ve saved me a lot of work with this.

4. cenebris - April 10, 2007

This option should be implemented in OOo…

5. Ubuntu | Carthik Sharma: Convert/Import from PDF and Keep the Formatting - April 10, 2007

[...] (more&#823 Previous Post: Eric Lake: You aren’t running Ubuntu? » Next Post: Jerome S. Gotangco: Dapper needs an update » Related Reading: [...]

6. n8k99 - April 10, 2007

oh freakin’ fantastic! i have been using kword pretty exclusively for six months and _never_ even thought about such a thing much less tried it out! i’m gonna go convert all my pdfs to .do^W.odt right now.

ok, maybe not all of them.

BTW, thanks.

7. Eleftherios Kosmas - April 11, 2007

lifesaver… cenebris is wright “This option should be implemented in OOo…”

8. Marcus - April 11, 2007

ooh neat. thanks!

9. Dirk Gently - April 12, 2007

Anyone thought to let the OpenOffice peeps know about this feature, and request they add it in?

10. carthik - April 12, 2007

Dirk, so how’s you detective agency doing? ;)

11. Dirk Gently - April 12, 2007

The experiment on the cat is still in progress, although I suspect he may be dead…..judging by the smell. lol.

So far, only two people have gotten the “Dirk Gently” reference. lol. I am thinking about doing two “about me” pages…..one about me, the other about Dirk Gently, as created by the late genius known as Douglas Adams.

12. ArtInvent - April 12, 2007

I’ve often wondered why pdf isn’t openable and editable in every decent word processor. It’s been the de facto document exchange format for years, and we’re only now getting around to actually being able to edit it freely? Good grief. If a dinky little prog like KWord can seem to manage it, why can’t a behemoth like OOo?

13. ArtInvent - April 12, 2007

Not to slight KWord or anything. Looks like a pretty capable prog.

14. ManwyMania » Convierte o importa un documento PDF sin perder el formato - April 14, 2007

[...] Este artículo es una traducción libre no oficial de “Convert/Import from PDF and Keep the Formatting“. [...]

15. ibidnah - April 14, 2007

convert /import and keep pdf formatting and then there it is and i can edit. terrific appreciate the info and well enoughh written awith what to do and how to do it succinct. terrific.

16. ibidnah - April 14, 2007

alll-lrighttttteeee then.

17. sx4teryzd344nlw - April 25, 2007

Tried importing PDF into KWord on several occasions and the result is poor formatting and it will not import images from the PDF file. Not so great I would say!

18. Friday Hot Links– Week of 5/11 at sfp101: Stress-Free Productivity - May 11, 2007

[...] Importing PDF text and formating into documents on Linux Ubuntu Blog- Convert/Import from PDF and Keep the Formatting [...]

19. Importar desde PDF manteniendo el formato « EsUbuntu.org - May 13, 2007

[...] de Ubuntu Blog  No Comments Leave a Commenttrackback addressThere was an error with your comment, please try [...]

20. Ron Hitler Barassi - May 15, 2007

KWord is good if the PDF document you’re converting is text-only or has only one or two images. If the PDF document is more complex than that, you’re wasting your time with KWord. Thanks carthik anyways.

21. Wezp Directory - June 9, 2007

KWord is good if the PDF document you’re converting is text-only or has only one or two images.

22. addthe - June 20, 2007

great job, but 105mb, hmm my admin will punish me :P

23. Joaquim - August 2, 2007

“Of course – this only works for pdf documents which are not scanned images of pages.”… but that’s quite what I’m needing now, converting some scanned as images pages to text format (.odt or .doc might do.) anyone can give suggestions?

24. linuxloon - August 9, 2007

Thanks for the tip. Saved me a lot of work. I’m adding this to my personal Tip’s-n-Trick’s list.

25. Convert/Import from PDF and Keep the Formatting « Ubuntu Blog « Martin Schaaf’s Blog - April 1, 2008

[...] Convert/Import from PDF and Keep the Formatting « Ubuntu Blog Filed under: Misc — Tags: convert, import, kword, pdf, pdf convert, pdf import — mascha @ 14:44:22 +0000 Convert/Import from PDF and Keep the Formatting « Ubuntu Blog [...]

26. Andrew Ziem - June 6, 2008

cenebris, it is now! Try the PDF import extension for OpenOffice.org. It also supports hybrid PDFs.

27. mark - June 23, 2008

OK but for small size doc and yes…………txt only. what a waste! keep on searching

28. Sir Gee, - July 15, 2008

Unfortunatly, the PDF Import extension is ONLY for Ooo 3 (Beta) not the current release version.

29. aktik - November 24, 2008

What i must send you if you need health?,

30. Tudor - January 21, 2009

It installed along with other ~ 180 megabytes of libraries but it works like a charm, so it’s worth the waiting.

Thank you for your post

31. Roddo - February 8, 2009

hello there… i add a text written in Greek and what i get is a mixed up words and symbols ..Tried to change the Unicode but didn’t worked … Any advices will be very useful !! thanks a lot

32. Varsha Jaikumar - February 9, 2009

Hi there, great job! It is really useful indeed.
But do you have any suggestions for converting pdf files into office document, that also contains scanned pages, on a gnome desktop? I am using Ubuntu 8.1.
Thanks in advance.
And all the best. Keep up ur work.

33. k - June 26, 2009

Hello, excuse my ignorance, but could someone provide some advice as to how i install this software. from scratch please, on a PC with vista. Is that even possible?

Thanks in advance.

34. micman - July 25, 2009

thanks a lot :)

35. How to convert pdf to html on Ubuntu 9.04 « Computer Borders - October 26, 2009

[...] It was not bad but we were really far from what we were looking for. As second experiment (2) we tried to use Kword and we had (as html) text and pictures but there were many incongruences [...]

36. Emm - December 3, 2009

my Kword won’t import pdf files.. even more, they do not appear when exploring the folders from “import…” option.

Do I need a plugin or something? What’s wrong with my Kword?

37. davide - February 10, 2010

on ubuntu 9.10 i386 this function is not available :-( how can I fix it?

38. Sjoerd - March 25, 2010

Function no longer available?

39. emiliano - April 12, 2010

same here: ubuntu 9.10 Karmic, the function happens to be lo longer available

40. Christian - June 3, 2010

How to open .pdf in KWord? eventhough the Filter is All Supported File, I can’t see my .pdf file at my /home/myname/Downloads

41. Christian - June 3, 2010

What version of KWord is this? Please assist us!

42. sikiş izle - September 26, 2010

ooh neat. thanks!

43. antalya ilaçlama - September 27, 2010

What a great thing – KWord from KDE Office seems to be the solution.

44. porno sikiş - September 27, 2010

lifesaver… cenebris is wright “This option should be implemented in OOo…”

45. sex sikiş - September 28, 2010

KWord is good if the PDF document you’re converting is text-only or has only one or two images. If the PDF document is more complex than that, you’re wasting your time with KWord. Thanks carthik anyways.

46. john phillips - October 23, 2010

KWord is not good for importing .pdf files

47. D Chatterjee - November 26, 2010

Hi Carthik. I use Ubuntu 10.10 and stumbled upon this post. I have just now installed KWord through Synaptic. It installed ok, but when I follow your instructions, I find that under file -> import, there is no option listed to open pdf filed. That is, the “file type” doesn’t list pdf, and hence in the folder, it doesn’t show up any pdf docs. Any suggestions? Thanks.

48. busybee - December 27, 2010

it doenst work anymore with kword:-( why has this import-option for pdf fils been removed?!?!

49. izmir escort - January 7, 2011

my Kword won’t import pdf files.. even more, they do not appear when exploring the folders from “import…” option.

50. alphaCMa - March 8, 2011

It doesn’t work for me neither…
I can’t import any pdf files, this function dosn’t exist… why, why?

51. Bill Davis - March 31, 2011

I am sorry to report that KWord 2.1.2 on Ubuntu 10.04 lucid no longer will import PDF files, even if you rename them to something other than PDF.

52. fiona - May 18, 2011

Bill Davis: Is that true?

I haven’t had a try with it. It’s difficult to import PDF especially import pdf to word for further editing. As fas as I know you need to find a good pdf converter or editor.

53. ibs 2011 link - November 25, 2011

if you have trouble with ibs try the link below trying to work out cures for the deaded ibs irritable bowl syndrome 2012.

ibs blog help click link

54. Nackte Frauen - March 2, 2013

What’s up, yes this piece of writing is truly fastidious and I have learned lot of things from it on the topic of blogging. thanks.

55. http://Tinyurl.com/blogmason56562 - April 13, 2013

I personally needed to share this specific blog post, “Convert/Import
from PDF and Keep the Formatting | Ubuntu Blog” along with my
best friends on twitter. I actuallymerely wished to pass on your remarkable publishing!
With thanks, Shenna

56. webpage - October 25, 2013

I actually have a tendency to agree with every aspect
that ended up being composed in “Convert/Import from PDF and Keep the Formatting | Ubuntu Blog”.

I am grateful for all the details.I appreciate it,Brianna

57. air conditioning adelaide - May 25, 2014

air conditioning adelaide

Convert/Import from PDF and Keep the Formatting | Ubuntu Blog

58. Jual Alat Bantu Sex Murah - June 23, 2014

Pretty component to content. I just stumbled upon your site and in accession capital to say that I acquire actually enjoyed account your weblog posts.
Anyway I will be subscribing in your feeds or even I success you access persistently rapidly.

59. discount seiko watches australia - July 5, 2014

They can be found in different colors, shapes, styles and structures that would surely fit you personal taste.

Once he gets his watch safely in his face
to face successful delivery, they can pay for it.
In the late nineteenth century, Bahrain had already be a prominent trade
hub to the region and started attracting attention from investors.

60. easy-reglement.com - August 21, 2014

Each new age brought the criminal element forward with it.
They only want your money and they want as much of it as they can possibly get.
Make certain that the payment gateway you are making
use of, permits similar languages as the remaining of the web pages so that they can match well
together.

61. Louis Vuitton Handbags - September 8, 2014

Everything is very open with a very clear description of the issues.
It was truly informative. Your website is very useful.
Many thanks for sharing!


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

Join 532 other followers

%d bloggers like this: