Can I find the size of an entire website?

sofasurfer

Member
Joined
May 24, 2022
Messages
49
Reaction score
16
Credits
410
I'm using Ubuntu 20.04
Is there a way, preferably from the command line, to find the total size of a website?
 


KGIII

Super Moderator
Staff member
Gold Supporter
Joined
Jul 23, 2020
Messages
8,625
Reaction score
7,368
Credits
70,127
Not realistically, no...

Unless, of course, you're hosting the website. If you're hosting the site, just use 'ncdu' on the appropriate folder.

If you do not host the site, then no... No, you can probably use httrack/wget to download all the pages (considered kinda rude), but that'd be just the pages and images and stuff like that... There's often still a lot you won't see and download, like the database that populates all that.

So, no... That's not something you can realistically do.
 

Giesbert

New Member
Joined
Sep 2, 2022
Messages
3
Reaction score
3
Credits
18
If this is your own website running on your own server:
Code:
du -hs /var/www/nextcloud/
 

forester

Well-Known Member
Joined
Mar 5, 2022
Messages
544
Reaction score
315
Credits
3,829
Puppy Linux has a software --
Pfetch allows downloading an entire site, from which should allow one be able to determine size.
Another link from cyberciti
 

KGIII

Super Moderator
Staff member
Gold Supporter
Joined
Jul 23, 2020
Messages
8,625
Reaction score
7,368
Credits
70,127
Pfetch allows downloading an entire site,

That is only true if the site is pure HTML type stuff. You won't get things like the backend stuff - databases and stuff like that. My linux-tips site is about 10 GB in size, with probably about 1/20 of that being accessible publicly with tools like Pfetch or HTTrack (or wget, which you can set to no-clobber at least).
 

forester

Well-Known Member
Joined
Mar 5, 2022
Messages
544
Reaction score
315
Credits
3,829
Thanks for clarification.
 

Rob

Administrator
Staff member
Joined
Oct 27, 2011
Messages
1,069
Reaction score
2,020
Credits
2,417
Last edited:

f33dm3bits

Gold Member
Gold Supporter
Joined
Dec 11, 2019
Messages
5,591
Reaction score
4,157
Credits
40,812
There is a high chance that it is in the default repos of the distribution you run as well.
Code:
httrack.x86_64 : Website copier and offline browser
 
MALIBAL Linux Laptops

Linux Laptops Custom Built for You
MALIBAL is an innovative computer manufacturer that produces high-performance, custom laptops for Linux.

For more info, visit: https://www.malibal.com

Staff online


Top