Unix Technical Forum

problem with wget

This is a discussion on problem with wget within the Slackware Linux Support forums, part of the Unix Operating Systems category; --> Hi all, I have to mirror an http site using wget but I got a file not found error, ...


Go Back   Unix Technical Forum > Unix Operating Systems > Slackware Linux Support

FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 02-18-2008, 11:02 AM
mrlinux
 
Posts: n/a
Default problem with wget

Hi all,

I have to mirror an http site using wget but I got a file not found
error, and I think this is due to a firewall.

I used the command:
wget -cr http:/.......

Can you suggest me another program to mirror this site?


Thank you
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 02-18-2008, 11:02 AM
Simon
 
Posts: n/a
Default Re: problem with wget

On 4 Sep 2003 10:26:57 -0700, mrlinux <mr_linux@tiscali.it> wrote:
> I have to mirror an http site using wget
>
> wget -cr http:/.......


You might like to look at the '-m' option (instead of '-r') in the wget
man page.


--
Simon <simon@no-dns-yet.org.uk> **** GPG: F4A23C69
"We demand rigidly defined areas of doubt and uncertainty."
- Douglas Adams

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 02-18-2008, 11:02 AM
Mark Hill
 
Posts: n/a
Default Re: problem with wget

On 2003-09-04,
mrlinux <mr_linux@tiscali.it> wrote:
>
> I have to mirror an http site using wget but I got a file not found
> error, and I think this is due to a firewall.


What's the exact error you're getting?

If it turns out to be a robots.txt issue, then IIRC you'd have to
recompile wget for it not to honour this file.

> Can you suggest me another program to mirror this site?


A few people like a program called mirror. I've heard it's good.
I *think* this is the correct link to the project home page.

<URL:http://sunsite.org.uk/packages/mirror/>


--
Mark Hill <mark_usenet@yahoo.co.uk>
"Remind me not to fix mtrr.c after half a litre of wine in future."
- Alan Cox
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #4 (permalink)  
Old 02-18-2008, 11:02 AM
David
 
Posts: n/a
Default Re: problem with wget

mrlinux wrote:
> Hi all,
>
> I have to mirror an http site using wget but I got a file not found
> error, and I think this is due to a firewall.
>
> I used the command:
> wget -cr http:/.......
>
> Can you suggest me another program to mirror this site?


If you are wanting to mirror a site you might take a look at
"rsync" to do it.

# rsync -av --delete ftp.server-name.com:ath/to/directory

--
Confucius: He who play in root, eventually kill tree.
Registered with The Linux Counter. http://counter.li.org/
Slackware 9.0 Kernel 2.4.22 i686 (GCC) 3.3
Uptime: 5 days, 15:49, 1 user, load average: 1.55, 1.42, 1.26

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #5 (permalink)  
Old 02-18-2008, 11:02 AM
C. Newport
 
Posts: n/a
Default Re: problem with wget

On Thursday 04 September 2003 6:47 pm in alt.os.linux.slackware Mark Hill
wrote:

> On 2003-09-04,
> mrlinux <mr_linux@tiscali.it> wrote:
>>
>> I have to mirror an http site using wget but I got a file not found
>> error, and I think this is due to a firewall.

>
> What's the exact error you're getting?
>
> If it turns out to be a robots.txt issue, then IIRC you'd have to
> recompile wget for it not to honour this file.
>
>> Can you suggest me another program to mirror this site?

>
> A few people like a program called mirror. I've heard it's good.
> I *think* this is the correct link to the project home page.
>
> <URL:http://sunsite.org.uk/packages/mirror/>


BTDT - mirror uses ftp so it is not as efficient as rsync.
rsync is fast, works well and is easy to set up, but it does need the
site that you are mirroring to be running an rsync server.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #6 (permalink)  
Old 02-18-2008, 11:02 AM
Mark Hill
 
Posts: n/a
Default Re: problem with wget

On 2003-09-04,
C. Newport <crn@NOSPAM.netunix.com> wrote:

> BTDT - mirror uses ftp so it is not as efficient as rsync.
> rsync is fast, works well and is easy to set up, but it does need the
> site that you are mirroring to be running an rsync server.


I just realised that the OP was trying to mirror over http, so perhaps
neither of these options is available to him.

Yes, rsync looks *much* better. I'm only just starting to get in to it.


--
Mark Hill <mark_usenet@yahoo.co.uk>
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #7 (permalink)  
Old 02-18-2008, 11:03 AM
Billy Watt
 
Posts: n/a
Default Re: problem with wget

In news:3de60166.0309040926.4af6dc90@posting.google.c om,
mrlinux <mr_linux@tiscali.it> rambled:

>
> I have to mirror an http site using wget but I got a file not found
> error, and I think this is due to a firewall.
>
> I used the command:
> wget -cr http:/.......
>
> Can you suggest me another program to mirror this site?
>


You could try HTTrack.

Billy


Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #8 (permalink)  
Old 02-18-2008, 11:03 AM
Jurgen Philippaerts
 
Posts: n/a
Default Re: problem with wget

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Thu, Sep 04, 2003 at 10:26:57AM -0700, mrlinux wrote:

> I have to mirror an http site using wget but I got a file not found
> error, and I think this is due to a firewall.


> Can you suggest me another program to mirror this site?


ask the people who are running the site that you *HAVE* to mirror what
they recommand.


Jurgen.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2 (GNU/Linux)

iD8DBQE/V94l1ucXIiwNwbURAikAAJ9wB2sdfUe/JEOAmyrMbAHEEY8uVgCeOAor
sAWIPYsYRb//axBdcZQ/19A=
=z9b0
-----END PGP SIGNATURE-----
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 11:46 PM.


Powered by vBulletin® Version 3.6.5
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
SEO by vBSEO 3.2.0
www.UnixAdminTalk.com