Jump to content

snoopy


skym

Recommended Posts

Anybody familiar with Snoopy? http://snoopy.sourceforge.net/I'm using Snoopy v1.2.3 for a month and have recently come to a problem with a particular site. When trying to fetch http://laptops.lookup.nl/, all I get is the first few rows, just exactly a javascript code that is in the beginning of the page. At first I have thought the problem might be about code (Urchin, Google Analytics), but a few other subdomains work very well with the same JS code at the beginning.Then realized that the fetch result I get is the source you get when accessing http://lookup.nl/ (the page will show blank, but if you check the source you can see the Urchin code). http://lookup.nl/ is fetched instead of http://laptops.lookup.nl/? The problem persists with http://laptops.lookup.nl/ and other subdomains, and everything works with some subdomains (like http://computeronderwerpen.lookup.nl/) and with http://www.lookup.nl/.ALL pages are read correctly when using fopen(), file_get_contents(), fsockopen() or file(), used separately in a testing script. Snoopy is using fsockopen().Thanks

Link to comment
Share on other sites

Hmm, looks like that Snoopy appends the port to the host in the headers he is sending: GET / HTTP/1.0 User-Agent: Snoopy v1.2.3 Host: laptops.lookup.nl:80 Accept: image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, */* Changing Host: laptops.lookup.nl:80 to Host: laptops.lookup.nl seems to solve the problem.

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...