Jump to content

Easy way to pull text from a site?


ndelc

Recommended Posts

Does anyone know if there is any easy (automated) way to pull all of the text out of a website and save it as a single (or multiple) text-based file? We'd like to take the text from our website and save it separately to have it translated into different languages, and create support files, etc. Thanks for any help!

Link to comment
Share on other sites

If it's on your website, consider separating the translatable content from the common layout. Whether in a database or separate HTML files... the idea is to then stich them up back by using something like PHP.BTW, save the "we" for your clients. In this forum, you are one.

Link to comment
Share on other sites

It is actually very easy to isolate the text content of any HTML document from the surrounding structure. If every page is structured in EXACTLY the same way, you could also automate the way individual sections of text are stored.More likely, every page is a little different. Different number of elements, different kinds of elements, different element ID's, and so on. If that is the case, it will be very difficult, and maybe impossible, for an automated system to examine a document, pull out sections of text, and store them in a way that makes sense.

Link to comment
Share on other sites

Does anyone know if there is any easy (automated) way to pull all of the text out of a website and save it as a single (or multiple) text-based file? We'd like to take the text from our website and save it separately to have it translated into different languages, and create support files, etc. Thanks for any help!
What HTML Editor do you use?Does it have a function similar to STRIP HTML?Mine has that function and it works very well.
Link to comment
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...