Jump to content

How to parse a word document


Recommended Posts

How can we parse a word document using asp.net and c#.

The question I have is with respect to parsing a word document, say a resume, to read the contents like name and email address. For this, I have an idea, but not sure of how to implement it. Can someone help me on this?My idea is:

-Take the first line, if there are two or more words (other than curriculum vitae)separated by a single space or using a period(.) followed by a space, it is a name.

-Take the second line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

-Consider the last line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

-Consider the second last line, if there are two or more words separated by a single space or using a period(.) followed by a space, it is a name.

 

I know how to read tellno and email using patters, but, I dont know how to implement finding out the name. Is there any idea ?

  • Like 1
Link to post
Share on other sites
  • 1 year later...
  • 2 years later...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...