+ Reply to Thread
Results 1 to 27 of 27

Paragraphing from Text

  1. #1
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Paragraphing from Text

    I have some PDF books/stories that weren't formatted well, so that the paragraphs are stacked on top of each other instead of having a line between them. I want to print some of these, and make them as readable as possible.

    If I copy from PDF and paste into Word, or convert from PDF to Word, is there a way to separate each paragraph by putting a line beneath it, separating it from the next? An example is below.

    This is how it looks now:
    Please Login or Register  to view this content.
    This is way I would like it:
    Please Login or Register  to view this content.

  2. #2
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    Please Login or Register  to view this content.
    Last edited by snb; 10-08-2011 at 09:15 AM.



  3. #3
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    SNB,

    The code didn't work at first, then I discovered ThisDocument was mispelled. However, even after correcting for that, running the code produced no effect.

    I've attached a sample I'm trying with.
    Attached Files Attached Files
    Last edited by jomili; 10-07-2011 at 05:03 PM.

  4. #4
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    It works here in your file.

  5. #5
    Forum Guru
    Join Date
    03-12-2010
    Location
    Canada
    MS-Off Ver
    2010 and 2013
    Posts
    4,418

    Re: Paragraphing from Text

    Maybe replace thisdocument with activedocument, but this won't give you the result you are looking for. When you convert or import PDFs into Word it assumes that every line in PDF file is a paragraph on its own. Therefore when you run this code, all it will do is create the impression of double-spacing. You can modify the code that snb provided to replace the paragraph marks with a space and then it will all be merged into one paragraph and then manually add in the correct paragraph marks. I don't know of any way to automate this from a PDF file to a Word document.

    Hope this helps.

    abousetta
    Please consider:

    Thanking those who helped you. Click the star icon in the lower left part of the contributor's post and add Reputation.
    Cleaning up when you're done. Mark your thread [SOLVED] if you received your answer.

  6. #6
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Hmm...

    SNB, I found my problem with your code; I had it in a module in NORMAL. When I moved it to a module for my document, it worked. However, it worked the way Abousetta described, creating the impression of double-spacing, not paragraphing as intended.

    Would there be a way to do it based on line-length? I'm no expert in Word, but isn't there a limit to the number of characters that can exist in a line (determined by font and margins). So, if a line is less than, say, 95% of the max characters, we can assume that's a paragraph end, and plug in a paragraph mark?

    I'm sure I'm not the only person who's run into this problem. If Word can't do it, do you know of a non-Word based approach that might work?

  7. #7
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Still desiring a solution.

  8. #8
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    What about checking the first character of each paragraph ?
    If it's lowercase than it's part of a paragraph, not a new one.

  9. #9
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    That would probably straighten it up a lot, but it wouldn't catch the ones where the line begins with a capital letter.

    Going with my earlier thought, could I copy the text into Excel, and somehow get each line into a cell of it's own, then do LEN to determine length, plug in a symbol (say, a tilde) in the cell to the right of the cells with the least characters, assuming those are paragraph endings, then paste back into Word and replace the tildes with a paragraph symbol?

    I'm definitely in favor of an easier solution if it's out there.

  10. #10
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    It wouldn't just straitghten up a 'lot', but probably 99 %.
    Reason enough to do this.
    Your Excel 'solution' won't bring you closer.
    Do not skip to Excel because of your unfamiliarity with Word. Word's VBA is much more capable to handle texts than Excel's.
    Last edited by snb; 11-11-2011 at 05:17 AM.

  11. #11
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    I'm all in favor of using Word's VBA, but am completely lost here.

    So, how would we test the first character in each paragraph? I'm not even familiar enough with Word's VBa to get started.

  12. #12
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    Use VBA's intellisense.
    Start typing Activedocument. in the VBEditor

  13. #13
    Forum Guru
    Join Date
    03-02-2006
    Location
    Los Angeles, Ca
    MS-Off Ver
    WinXP/MSO2007;Win10/MSO2016
    Posts
    12,941

    Re: Paragraphing from Text

    Search for the period and paragraph marker:
    Please Login or Register  to view this content.
    Ben Van Johnson

  14. #14
    Forum Guru
    Join Date
    03-12-2010
    Location
    Canada
    MS-Off Ver
    2010 and 2013
    Posts
    4,418

    Re: Paragraphing from Text

    Here is my humble attempt at combining the codes from snb and protonLeah

    Please Login or Register  to view this content.
    The end result will look something like:

    Please Login or Register  to view this content.
    Hope this helps.

    abousetta
    Last edited by abousetta; 11-15-2011 at 04:43 AM. Reason: Added period to code

  15. #15
    Forum Expert
    Join Date
    06-09-2010
    Location
    Australia
    MS-Off Ver
    Excel 2013
    Posts
    1,714

    Re: Paragraphing from Text

    Hi
    I use this code in Word to quickly reformat imported text where lines import as paragraphs. It:
    • replaces paragraph marks with spaces where they precede a word that does not begin with a capital letter (note this removes breaks intended to precede lower case letters, eg dot points. You might want to sort those out separately)
    • Removes paragraph breaks between commonly capitalised words and phrases
    • removes para breaks after words unlikely to precede a para break (such as 'and')
    you might want to save your file before you try it!

    Please Login or Register  to view this content.

  16. #16
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    or
    Please Login or Register  to view this content.

  17. #17
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Sorry it's taken me so long to get back to this.

    ProtonLeah, I tried your code, and it did an excellent job. The only thing it didn't seem to catch was paragraphs that end in quotation marks.

    Abousetta, wow! I tried your code, and again the only problem was with quotation marks. See the paragraph below for a sample.
    Please Login or Register  to view this content.
    SNB and NickyC, I also tried your codes. They didn't seem to do anything noticeable, though Nicky's caused a lot of furious whirring before ending up with what looked to all intents and purposes like the original document I started with. I've attached the document I'm trying all of these macros on so you can tell me if I've done something wrong and can see the same results I'm seeing.
    Attached Files Attached Files

  18. #18
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    Please Login or Register  to view this content.
    PS. to see what has happened you should make all 'invisible' characters visible. (tabs, spaces, paragraph signs, etc.)
    Last edited by snb; 11-15-2011 at 11:43 AM.

  19. #19
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Thanks SNB; good idea on making the invisible visible.

    Your latest rendition does a great job, but still having trouble with quotations. I don't think they'll ever be 100%, but I think we can get closer. See the text below:
    Please Login or Register  to view this content.
    Notice that we have some back-to-back quotation marks? If we have two quotation marks together, chances are good that we should have a paragraph break between them. Also, quotation marks usually travel in pairs; where there's one, there should be a second, with no paragraph break between. So, the paragraph break following "mother" shouldn't be there. I'm not understanding how your macro works, so don't know how to tweak it for these situations. I'd appreciate some help in understanding.

  20. #20
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    The macro is a one-to-one translation of what I suggested in English previously.
    To handle the Quotation marks:

    Please Login or Register  to view this content.
    Last edited by snb; 11-15-2011 at 12:25 PM.

  21. #21
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    I think we have a problem with this last bit of code. When I run it it seems to delete a large portion of the quotation marks. See the same sample text I pasted earlier as it appears after running this macro:
    Please Login or Register  to view this content.

  22. #22
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    Not too complicated:

    Please Login or Register  to view this content.

  23. #23
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    No, but now we're back where we were. See the sample paragraph now:
    Please Login or Register  to view this content.

  24. #24
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    What does it show ?
    Attached Files Attached Files
    Last edited by snb; 11-15-2011 at 01:31 PM.

  25. #25
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Well, in this section we have back-to-back quotations, which should be separated by a paragrph break.
    Please Login or Register  to view this content.
    In this section we have a paragraph break in the middle of a section bounded by quotation marks (if there is one quotation mark, the next paragraph break shouldn't come until after the next quotation mark, except in rare circumstances).
    Please Login or Register  to view this content.
    Same thing with this section
    Please Login or Register  to view this content.

  26. #26
    Forum Expert snb's Avatar
    Join Date
    05-09-2010
    Location
    VBA
    MS-Off Ver
    Redhat
    Posts
    5,649

    Re: Paragraphing from Text

    Well, in this section we have back-to-back quotations, which should be separated by a paragraph break
    How come, what for ?

    meaning this ?

    She smiled rather wryly. "Not most of the time."
    "A mother like that! A house like this!

    Please reread your initial post.
    Last edited by snb; 11-15-2011 at 01:39 PM.

  27. #27
    Valued Forum Contributor
    Join Date
    12-02-2009
    Location
    Austin, Tx
    MS-Off Ver
    Office 365 64-Bit, 2108, build 14326.21018
    Posts
    4,056

    Re: Paragraphing from Text

    Right. If a person is speaking, their text is surrounded by quotation marks. If they speak five sentences, there's a quotation mark preceding the first sentence, and following the last sentence. So, if there are 2 quotation marks together, it indicates the end of one person speaking, and the beginning of another person speaking. Usual convention is that the speaker's dialogues are separated into distinct paragraphs. See below for how this particular section should be paragraphed:
    Please Login or Register  to view this content.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1