Full Description of File
CD-ROM Duplicate Analyzer v1.3
Will read "allfiles" listing for 2 CD-ROM
discs and quickly produce a listing of
duplicate files found between each, and
duplicates found within the disc. Want to
know if that new CD wil duplicate what you
already have? Use CDDA to prepare analysis.
Supports 20,000 files/CD list. 08/24/93
Author:Robert Neal NOT CRIPPLED!
Contents of the CDDA.DOC file
Corporate Headquarters BBS (815) 886-3233
CD-ROM DUPLICATE ANALYZER
by Robert Neal
317 Hickory Avenue
Romeoville, IL 60441
THIS SOFTWARE AND ACCOMPANYING MATERIALS ARE DISTRIBUTED "AS IS" WITHOUT
CHARGE AND WITHOUT WARRANTY, EXPRESS, IMPLIED OR STATUTORY, INCLUDING BUT NOT
LIMITED TO ANY IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A
PARTICULAR PURPOSE. IN NO EVENT SHALL ANYONE INVOLVED WITH THE CREATION AND
PRODUCTION OF THIS PRODUCT BE LIABLE FOR INDIRECT, SPECIAL, OR CONSEQUENTIAL
DAMAGES, ARISING OUT OF ANY USE THEREOF OR BREACH OF ANY WARRANTY.
It used to be simple, a new CD ROM Shareware disc would be released
about every 4-6 months. Now though, we have several key players in the
market, all releasing discs that overlap files with one another. Before
plunking down your cash, you ask "what's the duplication rate with brand
I originally wrote CD-ROM Duplicate Analyzer for my own personal use.
I had an interest in just what the number of duplicate files between two
CD's were, after all, no sense in purchasing a disc that duplicates what
I already have available. The program was crude, but it did what I
wanted, quickly and easily. I, in turn, posted my "findings" in the
CD-ROM conference on RIME (a national message network). Surprisingly, I
had people asking where they could get the utility that I used to create
the analysis. I even had several CD ROM distributors contact me directly
asking if I could make the program available to them.
With all this positive feedback and encouragement on an unreleased
program, I decided to put some time behind the keyboard to polish up my
code for public release. The time spent polishing and enhancing the code
has taken more time than the original crude version. The end result is a
program that I still think is easy to use and retains it's speed. The
response to this initial shareware offering will serve as encouragement
to continue to release future enhanced versions as well as releasing
WHAT THE PROGRAM DOES
Quite simply, CDDA reads two file listings. The only requirement of the
file listing is that the filename, with extension, be the first field in
each line. Most "allfiles" listings are in this format. There is no need
to strip out any information from the file listing, headers, comments,
additional file comment lines will all be ignored.
If you only have a listing of individual DIRxx files, typically for
PCBoard systems, these can be quickly combined into one file listing for
processing by CDDA by using the batch file, CDCLIST.BAT, included with
this program. For other formats, you can create an "allfiles" listing
using utility programs specific for your BBS file formats. The batch
file simply performs a recurrsive loop and using the DOS copy function,
copies each DIRxx file out to a filename CDDA.LST. This filename can be
renamed by you.
After reading a file listing, CDDA will perform an internal sort on the
filenames. The sort routine is quite fast. In my tests on a 386/25
system, 5,000 filenames are able to be sorted in about 15 seconds.
Once the filenames are sorted, a temporary file is created. CDDA will
then compare, on a like filename basis, filenames in both listings. To
be considered a "duplicate", the filenames must be exactly the same. One
benefit with CDDA is that it ignores the file extension, so if a file is
CD1 is listed as TESTPROG.EXE and listed in CD2 as TESTPROG.ZIP, CDDA
will mark these two as duplicates.
In addition to comparing duplicates between each other, CDDA will also
review the individual file listings for duplicates within themself at
the same time.
CDDA also can compare file sizes to determine if files are duplicates by
using the /VAR: command explained below.
Currently, CDDA does no further evaluation to verify if a indeed a
duplicate. CDDA will log all found duplicates which you may then review
manually if required.
Type CDDA at the command line. You will then be prompted for the
Disc Name #1 - used as a reference in the report output, enter whatever
is useful for you
File Listing Name #1 - The name of your "allfiles" file list for disc #1
Disc Name #2 - Again, used only for your reference
File Listing Name #2 - The other "allfiles" listing to be compared with
Report Name - Name of the output report you wish to be created, may
enter LPT1 to send to printer
After you have the above information, simply press F10 and you'll be on
your way. You may press ESC during the file listing read routine to
abort the process if necessary. If you leave Disc Name #1 blank the
program will also terminate. The program will now begin reading and
processing the data, quickly and easily.
That's all there is to it.
Here's a sample of the report output:
Analysis: 05-22-1993 20:22:28
CD1: NIGHT OWL 8
CD2: PIER 2
93_WPCAL Duplicate file
[..text deleted to save space..]
XWING Duplicate file
_1WOLF14 Duplicate file
_FDB402C Duplicate file
Total Files NIGHT OWL 8: 4945
Total Files PIER 2: 4740
Shared Duplicates: 42 ( .86%)
Dupes within NIGHT OWL 8: 12 ( .24%)
Dupes within PIER 2: 0 ( 0%)
You can include the following variables on the command line when running
the program, ie CDDA /REC:x /VAR:x. These commands do not need to be
entered as CDDA will default the values shown below.
Indicates the maximum number of filenames contained in a file
listing. This is used to allocate space to handle the internal
sort routine. The default is 12,500 and should be ample for most
needs. This has been tested up to 20,000 records which should
accomodate every need. Users who are limited on memory may need
to reduce this value (especially if the number of files is
significantly less than 12,500) to conserve memory space.
Indicates the file size variable that you would like to use for a
"duplicate". For example, if you entered /VAR:100, this would
instruct the program that the file size would have to be a
perfect match (in "K" as 241,000 and 241,724 would still be
considered a match). The default value is 0%, meaning the file
size can be any variance and still be considered a match (for
example, if a filename of two files were the same and one was
1,000 bytes and another was 200,000 bytes, CDDA would consider
these to be a duplicate. You can set to any value between 0% and
CD-ROM DUPLICATE ANALYZER is released as SHAREWARE. You are granted a
limited license to evaluate the program. Try it out, if you continue to
use it you are required to register the program. Your registration will
serve as encouragement for me to continue to release updates and
You will notice that I have NOT crippled the program in anyway. There is
a pause at the start up of the program to remind people that the program
is not registered. The registrations thus far have been few, yet I know
that there are CD vendors using this program. Isn't it amazing that the
vendors expect to be paid for selling a shareware disc, but they don't
support a program that they use. I debated on whether to restrict the
unregistered program in some form and decided to just go on the trust of
users who may use the program. I know I won't get rich, but if you can
use this prorgam to sell your product or make an informed buying
decision, you are expected to purchase the right to use this program.
Off the soap box, here's the registration fees:
To register the program send $10.00 along with your name and
address to: Robert Neal, 317 Hickory Avenue, Romeoville, IL
Special offer - register CDDA along with another program of
mine, CD-ROM File Date Analyzer for only $5.00 more.
To register the program for commercial use, send $20.00 along
with your business name and address to the address above.
Special offer for commercial users - register CDDA, and you can
also register another program of mine, CD-ROM File Date Analyzer
for only $10.00 more.
CD vendors/distributors may send a copy of your CD disc in lieu
of payment and receive registration numbers for both programs. I
enjoy reviewing new discs and post my reviews on the RIME
message network, a large international message network with
nearly 1,000 different BBS's participating. The review can reach
a large potential group of consumers, eager to purchase your
product. Any CD's received will be reviewed on an impartial
basis. If you feel your CD is of quality, here's your chance to
get some free "promotion" for your product and a free copy of
BOTH programs to boot!
CONTACT ME FOR SUPPORT
If you encounter any problems you may contact me at Corporate Headquarters
BBS (815) 886-3233 or (815) 886-9381.
If you have any suggestions for improvements for this program, please feel
free to let me know.
You can now register online with Visa or MasterCard. Call the support
BBS listed above and type CHARGE at the main menu prompt.
v1.0 - Initial release
v1.1 - Not released
v1.2 - Not released
v1.3 - Added .JPG, BMP, PCX, MPG, TGA, TGZ and PCX to the valid file
Added routine to verify duplicate file by comparing filesizes,
added the /VAR: command for this.
Greatly expanded the size of the filenames that could be sorted
from 7,500 to over 20,000 records. The default is set to 12,500
which should be more than ample, but this can be changed using
the /REC: option.