JANET Bandwidth Management Advisory Service
BMAS Home | About BMAS | Bandwidth Management | Technology Papers and Guidance | Product Evaluations | Calls for Partipation
Web Caching
Here you will find a wealth of resources about the subject of caching; an effective method of bandwidth management. Learn about what caching is and why it could be beneficial to your organisation, as well as information on some of the solutions available today.
- Introduction to Web Caching
- Technical Specification
- Caching Solutions
- Caching Resources
- Configure a Browser to Use a Local Cache
- Non-cacheable Objects
- Caching software guides
- Caching FAQs
Introduction to Web Caching
-
The aim of this document is to clarify what caching is and what its advantages are. Please also refer to the "Caching solutions" section of the website where you will find details of caching hardware and software solutions, together with some guides for popular caching software.
What is a web cache?
In Worldwide Web terms, a cache is a place where temporary copies of web objects are kept. Normally when a user accesses the internet, web object requests go directly to the site of origin with the web object then being returned back to the user.
With a local cache in operation, user web object requests go via the local cache which then retains a copy of the said web object. This results in all subsequent requests for the same object being fulfilled from the local cache instead of from the site of origin. This process of web caching minimises the amount of times identical web objects are transferred from remote websites by retaining copies of requested URLs in a cache.
A web cache can be installed utilising both software and / or hardware, and can run on various different platforms. The BMAS website contains details of technical specifications and caching solutions which should provide a wide enough range of information to suit most budgets and areas of expertise.
What are the advantages of creating a local web cache?
With a local cache in operation subsequent requests for previously cached URLs result in the cached copy of the object being returned to the user; creating little or no extra network traffic, improving efficiency and reducing waiting time. This can be particularly effective when you consider the number of requests sent to the same popular websites.
Without a cache all your users requests go directly to the remote site generating web traffic and using up bandwidth.
Top of page
Technical Specification
There are many different solutions available to you if you need to set up your own local cache.
The purpose of this page is to provide you with an outline of what is required to create a local cache. For more detailed information about the majority of caching software packages available (listed by platform) and proprietary hardware available, please see the "Caching Solutions" section below.
We normally recommend that you use a machine specification similar to the one stated below together with software which will run on the operating system with which you are most familiar, especially if you are new to caching. Larger or smaller scale institutions may need to differ the specification.
Please see the "Technical Guides" section for configuration information for the most popular caching software, i.e.Squid (UNIX systems) and MS Proxy Server/Internet Security and Acceleration Server (Windows systems)
Hardware specifications -
700Mhz PIII CPU
6 x Ultra2 LVD SCSI cache disks
512 MB RAM
We recommend that you run your system on about 512MB RAM (certainly no less than 128) with the fastest
SCSI cache disks you can afford.
Operating systems
Caching software is available for the Apple Macintosh, Novell, Sun Solaris, most flavours of UNIX, and Windows operating systems. See "Caching Solutions" for more details.
Top of page
Caching Solutions - software and proprietary hardware
Below we have listed all the main caching software solutions, sorted by platform. Many of these packages are actually web server packages with inherant caching facilities, although some, such as Squid, are dedicated caching packages.
Dedicated Caching software - Unix platform
- Squid - The Squid cache server can be used as a WWW cache or a WWW server "accelerator". Quick Squid configuration notes from BMAS are available.
The FULL Squid configuration guide is available on the Squid support website in HTML (compressed tar file) and PDF formats.
The full Squid Users guide can be found on the Squid website.
Software Solutions:- Windows only
- Avirt Gateway Server (a.k.a. Spaghetti) - Windows 9x/NT/ME/2000
Microsoft Internet Security and Acceleration Server - Windows 2000 Server - Microsoft Proxy Server- MS Proxy
- Viking- Windows NT, 9x, 2000, and XP
- Wingate - Windows 9x, ME, NT SP , 2000, XP
- Winproxy - Windows 9x /NT 4 / 2000/ ME
- Win Route Pro - all Windows OS
- Youngzsoft CCproxy - Windows 98/Me/NT/2000/XP
Software solutions Multi-platform: - Apple Mac, Netware, Solaris, Unix and Windows
- Apache Http server - Apache server has a proxy module which if compiled will act as a caching
- Border Manager - NetWare 5.1 SP 4 or NetWare 6.0 SP1
- IBM WebSphere Edge Server - IBM AIX, Linux, Sun Solaris, Windows 2000 Server
- Intergate- MacOS X 10.1 or above, System Software 7.6.1, 8.x, 9.x, Microsoft Windows 95, 98, NT4 SP3, 2000, Me or XP
- iPlanet Sun ONE Web Proxy Server - Sun Solaris 8/ Solaris 2.6, Windows NT 4 SP6, Windows 2000 Server SP1 and Windows 2000 Advanced Server, H.P HP-UX 11.0 and IBM AIX 4.3.3
- LogiSense engageIP Cache Server - avialable as hardware appliance or as a software only version. Compatible with Microsoft Windows 2000/ NT and Linux.
- Squid - Sun Solaris, most flavours of UNIX, Windows NT. Quick configuration notes from BMAS can be found here.
Top of page
Caching Resources
Below is a list of caching resources which should be useful. The list has links which cover all aspects of caching.
- Caching.com - Excellent resource with lots of information and links about proprietary caching devices. Includes links and information about many of the main commercial caching products such as Microsoft ISA server, Lucent Technologies 'imminet Web Cache' etc
- Caching the web with Linux - David Guerrero. A good all round guide to setting up a cache on Linux
- Cache Now! is a campaign to raise awareness about web caching and what individuals and organisations can do to save time, money & bandwidth for themselves and others.
- Brian Davidsons Web caching and content delivery resources - a wealth of information including a good introduction to caching.
- The DESIRE Web caching architecture recommendations . Very useful - includes a web caching system design checklist.
- IRCache Home Page - details the IRCache project which was formed in 1995, and has many useful resources.
- Joe Coopers' High Performance Web Caching with Squid - article describing how you can tweak Squid to get the best performance.
- O'Reilly Books - Web Caching by Duane Wessels, available to buy online from the O'Reilly website
- Squid Developers' Resource Page - an amazing amount of information about Squid + other links
- The Squid Home page Go here for the latest version of Squid, as well as everything you need to know about installing and configuring Squid caching software.
- Squid Users Guide A useful guide to getting Squid up and running, and pointers as to problems to expect.
- Terena (the Trans-European Research and Education Networking association) - has many caching links which can be found through a searchable front end.
- Web cache.com - A fairly up to date and comprehensive list of caching resources
Top of page
Non-cacheable Objects
-
The Internet provides access to many different types of file - some of these do not lend themselves to being cached. In order to use a local cache effectively, there are some web objects that should not be cached - a list of these current types of objects is shown below.
-
Caching software can be easily configured to route directly to the URL of origin for web objects such as those described below.
-
Nb. There are also some URLs which do not allow caching for various reasons. A list of such sites can be made available in the form of an Access Control List for Squid users.
WEB OBJECT EXAMPLE DESCRIPTION Dynamically generated web objects .php . asp Web objects which are returned from the result of input into a database SSL - a.k.a. https Sites beginning "https" Sites using Secure Socket Layer (SSL - aka https) normally bypass the cache. This is primarily for security reasons, as it is not recommended that sites containing personal data be cached.
If an https site is goes through the cache, the user often sees an error message indicating that the server or dns cannot be found.Real Audio and video Sites that use IP authentication Sites such as electronic journals etc Requests have to go directly to the server of origin to enable authentication. If these sites were cached, your request would arrive via another machine resulting in the IP address being unidentified by the end server. Local Sites or those within the academic domain (.ac.uk) Local and .ac.uk domain sites should not be cached to reduce latency to the request. Cost efficiency issues are not an issue with local sites.
Top of page
Caching software guides
- Microsoft Proxy guide PDF - comprehensive guide to using MS Proxy as a cache server.
- Microsoft ISA Server 2000 Configuration Notes PDF - comprehensive guide to using Microsoft ISA Server 2000 as a cache server.
Top of page
Frequently Asked Questions
- Squid cache configuration FAQ - this page explains some of the problems that you may encounter when setting up your own local cache. NB This page refers to the use of Squid software. For other caching software solutions, please see the 'caching' section and follow relevant links.
- Cachemaster FAQ - this page can help you resolve one the most common student access problems that local cachemasters come across.
Any problems, comments or suggestions regarding this page, please e-mail the BMAS service manager bmas@ja.net