ndelc Posted February 28, 2011 Share Posted February 28, 2011 Does anyone know if there is any easy (automated) way to pull all of the text out of a website and save it as a single (or multiple) text-based file? We'd like to take the text from our website and save it separately to have it translated into different languages, and create support files, etc. Thanks for any help! Link to comment Share on other sites More sharing options...
[dx] Posted February 28, 2011 Share Posted February 28, 2011 Copy/Paste Link to comment Share on other sites More sharing options...
boen_robot Posted February 28, 2011 Share Posted February 28, 2011 If it's on your website, consider separating the translatable content from the common layout. Whether in a database or separate HTML files... the idea is to then stich them up back by using something like PHP.BTW, save the "we" for your clients. In this forum, you are one. Link to comment Share on other sites More sharing options...
jeffman Posted February 28, 2011 Share Posted February 28, 2011 It is actually very easy to isolate the text content of any HTML document from the surrounding structure. If every page is structured in EXACTLY the same way, you could also automate the way individual sections of text are stored.More likely, every page is a little different. Different number of elements, different kinds of elements, different element ID's, and so on. If that is the case, it will be very difficult, and maybe impossible, for an automated system to examine a document, pull out sections of text, and store them in a way that makes sense. Link to comment Share on other sites More sharing options...
cousineaug Posted March 1, 2011 Share Posted March 1, 2011 Does anyone know if there is any easy (automated) way to pull all of the text out of a website and save it as a single (or multiple) text-based file? We'd like to take the text from our website and save it separately to have it translated into different languages, and create support files, etc. Thanks for any help!What HTML Editor do you use?Does it have a function similar to STRIP HTML?Mine has that function and it works very well. Link to comment Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.