--- linklint-2.3.5.orig/debian/changelog +++ linklint-2.3.5/debian/changelog @@ -0,0 +1,78 @@ +linklint (2.3.5-5.2) unstable; urgency=medium + + * Non maintainer upload by the Reproducible Builds team. + * No source change upload to rebuild on buildd with .buildinfo files. + + -- Holger Levsen Tue, 29 Dec 2020 01:08:01 +0100 + +linklint (2.3.5-5.1) unstable; urgency=medium + + * Non-maintainer upload. + * Move to debhelper compat 9. (Closes: #817566) + + -- Chris Lamb Sun, 25 Sep 2016 13:56:40 +0200 + +linklint (2.3.5-5) unstable; urgency=low + + * Remove me from Maintainers. + + -- Luk Claes Wed, 23 Jan 2008 22:50:00 +0000 + +linklint (2.3.5-4) unstable; urgency=low + + * New maintainers. + * Bumped Standards-Version to 3.7.2. + * Add debian/compat for debhelper 4. + + -- Luk Claes Sat, 30 Sep 2006 12:07:06 +0200 + +linklint (2.3.5-3) unstable; urgency=low + + * Fix misspelled LinkLint on output. Bug report and patch by + Justin B Rye, thanks. Closes: #237945 + * The -checksum option was documented but disabled, it is now + enabled. Bug report and patch by Justin B Rye, thanks. + Closes: #238054 + * Fix typo in -help message, s/Exaustive/Exhaustive/ + + -- Denis Barbier Sun, 21 Mar 2004 21:13:03 +0100 + +linklint (2.3.5-2) unstable; urgency=low + + * Add .png extension to the list of image format. + (Closes: Bug#115091) Thanks Jürgen A. Erhard + * Move to Architecture: all since linklint is a Perl script + * Bump Standards-Version: 3.6.1 and recode this changelog file + into UTF-8. + * Update current homepage in debian/copyright. + + -- Denis Barbier Thu, 9 Oct 2003 23:56:30 +0200 + +linklint (2.3.5-1) unstable; urgency=low + + * New upstream release + + -- Denis Barbier Mon, 17 Sep 2001 23:16:41 +0200 + +linklint (2.3.4-2) unstable; urgency=low + + * Install missing dot.gif and linklint.gif in + /usr/share/doc/linklint/html/ (Closes: Bug#144198) + + -- Denis Barbier Tue, 23 Apr 2002 23:14:10 +0200 + +linklint (2.3.4-1) unstable; urgency=low + + * New upstream release + * Improve documentation (Closes: Bug#105602, Bug#105698) + * Fix Description in control file + + -- Denis Barbier Tue, 17 Jul 2001 23:08:07 +0200 + +linklint (2.3.3-1) unstable; urgency=low + + * Initial Release (Closes: Bug#103957) + * Within URL paths, // are replaced by / to prevent infinite recursion. + + -- Denis Barbier Thu, 12 Jul 2001 22:55:51 +0200 + --- linklint-2.3.5.orig/debian/compat +++ linklint-2.3.5/debian/compat @@ -0,0 +1 @@ +9 --- linklint-2.3.5.orig/debian/control +++ linklint-2.3.5/debian/control @@ -0,0 +1,22 @@ +Source: linklint +Section: web +Priority: optional +Maintainer: Felipe Augusto van de Wiel (faw) +Standards-Version: 3.7.2 +Build-Depends: debhelper (>= 9) +Build-Depends-Indep: docbook-to-man + +Package: linklint +Architecture: all +Description: A fast link checker and web site maintenance tool + Linklint is a full featured Perl program for checking HTML links: + * both local and remote site checking + * cross referenced and fully hyperlinked output reports + * the ability to check password protected areas + * support for all standard server-side image maps + * reports of orphan files, and files with mismatching case + * a report of which URLs have changed since last checked + * support of proxy servers for remote URL checking + * come with full documentation. + . + Please visit http://www.linklint.org/ for more information. --- linklint-2.3.5.orig/debian/copyright +++ linklint-2.3.5/debian/copyright @@ -0,0 +1,17 @@ +This package was debianized by Denis Barbier on +Fri Oct 10 00:19:14 CEST 2003 + +Currently maintained by Luk Claes and + Felipe Augusto van de Wiel + +It was downloaded from + http://www.linklint.org/download/linklint-2.3.5.tar.gz + +No specific changes were needed. + +Upstream Author: James B. Bowlin + +James B. Bowlin released this version under the GNU GPL. + +On Debian GNU/Linux systems, the complete text of the GNU General +Public License may be found in `/usr/share/common-licenses/GPL'. --- linklint-2.3.5.orig/debian/dirs +++ linklint-2.3.5/debian/dirs @@ -0,0 +1,3 @@ +usr/bin +usr/share/man/man1 +usr/share/doc/linklint/html --- linklint-2.3.5.orig/debian/doc-base.package +++ linklint-2.3.5/debian/doc-base.package @@ -0,0 +1,15 @@ +Document: linklint +Title: Debian linklint Manual +Author: James B. Bowlin +Abstract: Linklint is a full featured Perl program for checking HTML links. + It supports both local and remote site checking, cross referenced and + fully hyperlinked output reports, the ability to check password + protected areas, and many other features. Furthermore output is very + detailed. This manual describes how to use linklint and read generated + reports. +Section: web + +Format: HTML +Index: /usr/share/doc/linklint/html/index.html +Files: /usr/share/doc/linklint/html/*.html + --- linklint-2.3.5.orig/debian/linklint.pod +++ linklint-2.3.5/debian/linklint.pod @@ -0,0 +1,615 @@ + +=head1 NAME + +Linklint - fast link checker and website maintenance tool + +=head1 SYNOPSIS + +B +[B<-cache I>] +[B<-case>] +[B<-checksum>] +[B<-concise_url>] +[B<-db1..9>] +[B<-delay I>] +[B<-doc>] +[B<-docbase I>] +[B<-dont_output I>] +[B<-error>] +[B<-flush>] +[B<-forward>] +[B<-help>] +[B<-help_all>] +[B<-host I>] +[B<-host I>] +[B<-htmlonly>] +[B<-http>] +[B<-http_header I>] +[B<-ignore I>] +[B<-index I>] +[B<-language I>] +[B<-limit I>] +[B<-list>] +[B<-local I>] +[B<-map I>] +[B<-net>] +[B<-netmod>] +[B<-netset>] +[B<-no_anchors>] +[B<-no_query_string>] +[B<-no_warn_index>] +[B<-orphan>] +[B<-out I>] +[B<-output_frames>] +[B<-output_index I>] +[B<-password I>] +[B<-proxy I>] +[B<-quiet>] +[B<-redirect>] +[B<-retry>] +[B<-silent>] +[B<-skip I>] +[B<-textonly>] +[B<-timeout I>] +[B<-url_doc_prefix I>] +[B<-version>] +[B<-warn>] +[B<-xref>] +I + +=head1 VERSION + +2.3.5 August 13, 2001 + +=head1 DESCRIPTION + +This manual page documents briefly the Linklint program, which is an +Open Source Perl program that checks local and remote HTML links. + +This manual page was written for the Debian distribution because the +original program does not have a manual page. Instead, it has +documentation in the HTML format; see below. + +=head1 OPTIONS + +=head2 Input File Selection + +Whether you are doing a local site check or an HTTP site check, you +specify which directories (presumably containing HTML files) to check +with one or more linksets. A linkset uses two wildcard characters +@ and #. +Each linkset specifies one or more directories much like the standard +* and ? wildcard characters are used to specify the characters in the +* names of files in one directory. + +The @ character matches any string of characters (this kind of acts like +"*"), and the # character (which is kind of like "?") matches any string +of characters except "/" . The best way to understand how @ and # work +is to look at a few examples: + + the entire site /@ + the homepage only (default) / + files in the root directory only /# + . . . and one directory down /#/# + files in the sub directory only /sub/# + files in the sub directory and below /sub/@ + specific files /file1 /file2 ... + specific subdirectories /sub1/@ /sub2/@ ... + +If you specify more than one linkset, files matching any of the linksets +will be checked. HTML files that don't match any of the linksets will be +skipped. Linklint will see if they exist but won't check any of their +links. + +=head2 Other File Selection Options + +=over 4 + +=item B<-skip> I + +Skips HTML files that match I. C will make sure +these files exist but won't add any of their links to the list of files +to check. Multiple I are allowed, but each must be preceded +with B<-skip> on the command line. Skipsets use the same wildcard +characters as linksets. + +=item B<-ignore> I + +Ignores files matching I. C doesn't even check to +see if these files exist. Multiple I are allowed, but each +must be preceded with B<-ignore> on the command line. Ignoresets use +the same wildcard characters as linksets. + +=item B<-limit> I + +Limits checking to I HTML files (default 500). All HTML files after +the first I are skipped. + +=back + +=head2 Local Site Checking + +If you are developing HTML pages on a computer that does not have an +http server, or if you are developing a simple site that does not use +Server Redirection or extensive CGI, you should use local site checking. + + linklint /@ + +Checks all HTML files in the current directory and below. Assumes that +the current directory is the server root directory so links starting +with "/" default to this directory. You must specify B to check the +entire site. See Which Files to Check for details. + + linklint -root dir /@ + +Checks all HTML files in dir and below. This is useful if you want to +check several sites on the same machine or if you don't want to run +Linklint in your public HTML directory. + +=head2 Other Local Site Options + +=over 4 + +=item B<-host> I + +By default C assumes all links on your site that start with +C are remote links to other sites. If you have absolute links +to your own site, give C your hostname and links starting with +C will be treated as local files. If you specify +B<-host hostname:port,> only http links to this hostname and port will +be treated as local files. + +=item B<-case> + +Makes sure that the filename (upper/lower) case used links inside of +html tags matches the case used by the file system. This is for +Windows only and is very handy if you are porting a site to a Unix +host. + +=item B<-orphan> + +Checks all directories that contain files used on the site for unused +(orphan) files. + +=item B<-index> I + +Uses I as the default index file instead of the default list used +by C. You can specify more than one file but each one must be +preceded by B<-index> on the command line. If a default index file is +not found, C uses a listing of the entire directory. See the +Default File section for details. + +=item B<-map> I + +Substitutes leading I with I. For server-side image maps or to +simulate Server Redirection. + +=item B<-no_warn_index> + +Turns of the "index file not found" warning. Applies to local site +checking only. + +=item B<-no_anchors> + +Tells C to ignore named anchors. This could ease memory +problems for people with large sites who are primarily interested in +missing pages and not missing named anchors. This option works for +both HTTP and local site checks. + +=back + +=head2 HTTP Site Checking + +If you have a complicated site that uses lots of CGI or Server +Redirection, you should use HTTP site checking. Even though an HTTP +site check reads pages via your HTTP server, you will get the best +performance if you do your checking on a machine that has a high speed +connection to your server. + + linklint -http -host www.site.com /@ + +The B<-http> flag tells C to check HTML files on the site +www.site.com via a remote http connection. You must specify a -host +whenever you do an HTTP site check (otherwise Linklint won't where to +get your pages). You can specify B to check the entire site. See Which +Files to Check for details. + +=head2 HTTP Site Check Options + +=over 4 + +=item B<-http> + +This flag tells Linklint to perform an HTTP site check instead of a +local site check. All files (except server side image maps) will be +read via the HTTP protocol from your web server. + +=item B<-host> I + +If you include I<:port> at the end of your +hostname, Linklint uses this port for the HTTP site check. + +=item B<-password> I + +Uses I and I as authorization to enter password +protected I Realms are named areas of a site that share a common +set of usernames and passwords. If passwords are needed to check your +site, Linklint will tell you which realms need passwords in warning +messages. Enclose the realm in double quotes if it contains spaces. If +no password is given for a specific realm, Linklint will try using the +password for the "C" realm if it was provided. + +=item B<-timeout> I + +Times out after I seconds (default 15) when +getting files via http. Once data is received, an additional +I seconds is allowed. The timeout is +disabled on Windows machines since the Windows port of Perl does not +support the C function. + +=item B<-delay> I + +Delays I seconds between requests to the same +host (default 0). This is a friendly thing to do especially if you are +checking many links on the same host. + +=item B<-local> I + +Gets files that match I locally. The default B<-local> +I is B<@.map> (which matches any link ending in F<.map>). This +allows Linklint to follow links through server-side image maps. The +default is ignored if you specify your own B<-local> expressions. You +need to specify the B<-root> directory for this option to work propery. + +=item B<-map> I + +Substitutes leading I with I. For server-side image maps or to +simulate Server Redirection. + +=item B<-no_anchors> + +Tells C to ignore named anchors. + +=item B<-no_query_string> + +Up until version 2.3.4, Linklint did not use query strings while doing +HTTP site checks. Query strings were removed before making HTTP +requests. As of 2.3.4 query strings in links are used in the +requests. Use the B<-no_query_string> flag to get back the "old" +behavior. + +=item B<-http_header> I + +Adds the HTTP header I to all HTTP requests +generated by Linklint. You will need to use quotation marks to hide +spaces in the header line from the command line interpreter. Linklint +will automatically add a space after the first colon if there is not +one there already. Multiple (unique) header lines are allowed. + +=item B<-language> I + +This option is only useful if you are checking a site that uses +content negotiation to present the same URL in different languages. + +Creates an HTTP Request header of the form +B that is included as part of all HTTP +requests generated by Linklint. Multiple B<-language> +specifications are allowed. This will result in a single +B header that lists all of the languages you +have specified in alphabetical order. Some web sites can use this +information to return pages to you in a specific language. + +If you need to get more complicated than this, use the more general +purpose B<-http_header> to create your own header. There is a partial +list of language abbreviations (taken from Debian) included as part of +the Linklint documentation. + +=back + +=head2 Remote URL Checking + +A remote URL check is used to see if a remote URL exists (or has been +recently modified). Links in the remote pages are not checked nor does +Linklint look for named anchors in remote URLs. + +Remote URL checking can be used to check all of the "remote" links on +your site (those that link to pages on other sites) or it can check a +list of URLs. There are several ways to specify which remote URLs to +check: + + linklint http://somehost/file.html + +Checks to see if F exists on somehost. Multiple URLs can be +entered on the command line, in an I<@commandfile>, or in an +I<@@httpfile>. Every URL to be checked must begin with C. This +will disable site checking. + + linklint @@httpfile + +Checks all the remote http URLs found in httpfile. Anything in the file +starting with C is considered to be a URL. If the file looks +like a F file generated by Linklint then all failed URLs will +be cross referenced. + + linklint @@ -doc linkdoc + +Assuming you have already done a site check and used B<-doc linkdoc> to +put all of your output files in the linkdoc directory, Linklint will +check all the remote links that were found on your site and cross +reference all failed URLs without doing a site check. You can use the +B<-netmod> or B<-netset> flags to enable the status-cache. + + linklint -net [site check options] + +The B<-net> flag tells Linklint to check all remote links after doing +either a local or HTTP site check site. If you are having memory +problems, don't use the B<-net> option, instead use one of the B<@@> +options above. + +=head2 Other Remote URL Options + +=over 4 + +=item B<-timeout> I + +Times out after I seconds (default 15) when getting files via http. +Once data is received, an additional I seconds is allowed. The +timeout is disabled on Windows machines since the Windows port of Perl +does not support the C function. + +=item B<-delay> I + +Delays I seconds between requests to the same host (default 0). This +is a friendly thing to do especially if you are checking many links on +the same host. + +=item B<-redirect> + +Checks for EmetaE redirects in the headers of remote URLs that +are html files. If a redirect is found it is followed. This feature is +disabled if the status cache is used. + +=item B<-proxy> I + +Sends all remote HTTP requests through the proxy server I and +the optional I. This allows you to check remote URLs or (new with +version 2.3.1) your entire site from within a firewall that has an http +proxy server. Some error messages (relating to host errors) may not be +available through a proxy server. + +=item B<-concise_url> + +Turns off printing successful URLs to STDOUT during remote link +checking. + +=back + +=head2 Status Cache Options + +The Status Cache is a very powerful feature. It allows you to keep track +of recent changes in all of the remote (off-site) pages you link to. You +can then use the Linklint output files to quickly check changed pages to +see if they still meet your needs. + +The flags below make use of the status cache file linklint.url (kept in +your HOME or LINKLINT directory). This file keeps track of the +modification dates of all the remote URLs that you check. + +=over 4 + +=item B<-netmod> + +Operates just like B<-net> but makes use of the status cache. Newly +checked URLs will be entered in the cache. Linklint will tell you which +(previously cached) URLs have been modified since the last B<-netset>. + +=item B<-netset> + +Like B<-netmod> but also resets the last modified status in the cache +for all URLs that checked ok. If you always use B<-netset>, modified +URLs will be reported just once. + +=item B<-retry> + +Only checks URLs that have a host fail status in the cache. Sometimes a +URL fails because its host is temporarily down. This flag enables you +to recheck just those links. An easy way to recheck all the cached URLs +with host failures is C. Use +C if you want failed URLs to be +cross referenced. + +=item B<-flush> + +Removes all URLs from the cache that are not currently being checked. +The B<-retry> flag has no effect on which URLs are flushed. + +=item B<-checksum> + +Ensures that every URL that has been modified is reported as such. This +flag can make the remote checking take longer. Many of the pages that +require a checksum are dynamically generated and will always be reported +as modified. + +=item B<-cache> I + +Reads and writes the F cache file in this directory. The +default directory is set by your LINKLINT or HOME environment variables. + +=back + +=head2 Output Options + +No output files are generated by default, only progress and a brief +summary of the results are printed to the screen. You can produce +complete documentation (split up into separate files) in a B<-doc> +directory or put selected output in a single B<-out> file or by +redirecting the standard output to a file. See the Output File +Specification section for a detailed description of all output files. + +=head2 Multi File Output + +=over 4 + +=item B<-doc> I + +Sends all output to the I directory. The output is divided +into separate F<.txt> and F<.html> files. Complete documentation is +always produced regardless of the single file flags. + +The file F contains an index to all the other files; +F is an HTML version of the index. The index files for +remote URL checking are F and F. + +=item B<-textonly> + +Prevents any HTML files from being created in the B<-doc> directory. + +=item B<-htmlonly> + +Erases redundant text files in the B<-doc> directory after they have +been used to create the HTML output files. The files F and +F are not erased since they can be used by Linklint to +recheck remote URLs. + +=item B<-docbase> I + +Overrides the default I expression used for directing a browser to +the resources listed in the output HTML files. The base is prepended to +local links in the output HTML files. This only affects the links in +HTML output files, it has no effect on what is displayed in these files. +Ordinarily this flag would only be used during a local site check to set +the base to C. + +=item B<-output_frames> + +All HTML output data files are linked to from F. If you use +this flag then the the data files will be opened up in a new frame +(window) which can be handy in some cases since it always leaves the +F file open in its own window. + +=item B<-output_index> I + +The output index files were previously named F and +F. These have now been changed to F and +F. You can use the B<-output_index> option to change this +name back to C or to something else. + +=item B<-url_doc_prefix> I + +By default, the output files associate with remote URL checking all +start with "url". You can change this with the B<-url_doc_prefix> +option. If the url_doc_prefix contains a "/" character then the +appropriate directory will be created (as a subdirectory of the -doc +directory). + +=item B<-dont_output> I + +Don't create output files that contain "xxxx". Can be repeated. +Example: + + -dont_output "X$" + +will supress the output of all cross reference files. + +=back + +=head2 Single File Output + +=over 4 + +=item B<-error> + +Lists missing files and other errors. + +=item B<-out> I + +Sends list output and summary information to I. + +=item B<-list> + +Lists all found files, links, directories etc. + +=item B<-warn> + +Lists all warnings. + +=item B<-xref> + +Adds cross references to the lists. + +=item B<-forward> + +Sorts lists by referring file. + +=back + +=head2 Debug and other Flags + +=over 4 + +=item B<-db1> + +Debugs command line input and linkset expressions. + +=item B<-db2> + +Prints the name of every file that gets checked (not just HTML files). + +=item B<-db3> + +Debugs HTML parser, prints out tags and resulting links. + +=item B<-db4> + +Debugs socket connection (kind of). + +=item B<-db5> + +Not used. + +=item B<-db6> + +Details last-modified status for remote URLs (requires B<-netset> or +B<-netmod>). + +=item B<-db7> + +Prints brief debug information while checking remote URLs. + +=item B<-db8> + +Prints all http headers while checking remote URLs. + +=item B<-db9> + +Generates random http errors. + +=item B<-version> + +Gives version information. + +=item B<-help> + +Lists a few simple examples of how to use Linklint. + +=item B<-help_all> + +Lists all help (contained in program) including every input option. + +=item B<-quiet> + +Disables printing progress to the screen. + +=item B<-silent> + +Disables printing summarys to the screen. + +=back + +=head1 AUTHOR + +Linklint is written by James B. Bowlin Ejbowlin@linklint.orgE. +This manual page was written by Denis Barbier +Ebarbier@debian.orgE for the Debian system (but may be used by +others) by cut'n'paste from original documentation written in HTML. + --- linklint-2.3.5.orig/debian/rules +++ linklint-2.3.5/debian/rules @@ -0,0 +1,53 @@ +#!/usr/bin/make -f +# Sample debian/rules that uses debhelper. +# GNU copyright 1997 to 1999 by Joey Hess. + +# Uncomment this to turn on verbose mode. +#export DH_VERBOSE=1 + +configure: configure-stamp +configure-stamp: + dh_testdir + touch configure-stamp + +build: configure-stamp build-stamp +build-stamp: + dh_testdir + pod2man --section=1 debian/linklint.pod > linklint.1 + touch build-stamp + +clean: + dh_testdir + rm -f build-stamp configure-stamp + rm -f linklint.1 + dh_clean + +install: build + dh_testdir + dh_testroot + dh_clean -k + dh_installdirs + install -m 755 linklint-* $(CURDIR)/debian/linklint/usr/bin/linklint + install -m 644 linklint.1 $(CURDIR)/debian/linklint/usr/share/man/man1 + install -m 644 doc/*.html doc/*.gif $(CURDIR)/debian/linklint/usr/share/doc/linklint/html + +# Build architecture-dependent files here. +binary-arch: build install +# We have nothing to do by default. + +# Build architecture-independent files here. +binary-indep: build install + dh_testdir + dh_testroot + dh_installdocs READ_ME.txt + dh_installman + dh_installchangelogs CHANGES.txt + dh_compress + dh_fixperms + dh_installdeb + dh_gencontrol + dh_md5sums + dh_builddeb + +binary: binary-indep binary-arch +.PHONY: build clean binary-indep binary-arch binary install configure --- linklint-2.3.5.orig/linklint-2.3.5 +++ linklint-2.3.5/linklint-2.3.5 @@ -440,7 +440,7 @@ -netset ... and update last modified status in the cache. -retry Only check urls that had host failures. -flush Remove urls from cache that aren't currently being checked. - -checksum Exaustive check of modified status. + -checksum Exhaustive check of modified status. -cache dir Read/save "linklint.url" cache file in this directory. Output: @@ -526,7 +526,7 @@ "html::\.($HtmlExts)\$", 'java archive::\.jar$', 'map::\.map$', - 'image::\.(gif|jpg|jpeg|tif|tiff|pic|pict|hdf|ras|xbm)$', + 'image::\.(png|gif|jpg|jpeg|tif|tiff|pic|pict|hdf|ras|xbm)$', 'text::\.txt$', 'audio::\.(au|snd|wav|aif|aiff|midi|mid)$', 'video::\.(mpg|mpeg|avi|qt|mov)$', @@ -636,6 +636,7 @@ $MiscFlags = join "|", qw/ case + checksum concise_url error flush @@ -1306,7 +1307,7 @@ # LinkRemote($link, $referer) # # Checks $link via http. If it is an html file it is parsed and -# the results go into local lists maintained by LinkLink(). +# the results go into local lists maintained by LinkLint(). #-------------------------------------------------------------------------- sub LinkRemote @@ -2472,7 +2473,7 @@ $TotFail = $UrlFail; $CheckedUrls && print - &Plural($CheckedUrls, "Linklink checked %d url%s:\n"), + &Plural($CheckedUrls, "Linklint checked %d url%s:\n"), &Plural($UrlOk," %d %w ok, "), $TotFail, " failed", $UrlMoved ? &Plural($UrlMoved, ". %N url%s moved") : '',