Search the FAQ Archives

3 - A - B - C - D - E - F - G - H - I - J - K - L - M
N - O - P - Q - R - S - T - U - V - W - X - Y - Z - Internet FAQ Archives

Artificial Intelligence FAQ: FTP Resources 7/7 [Monthly posting]

( Part1 - Part2 - Part3 - Part4 - Part5 - Part6 - Part7 - MultiPage )
[ Usenet FAQs | Web FAQs | Documents | RFC Index | Cities ]
Archive-name: ai-faq/general/part7
Posting-Frequency: monthly
Last-Modified: Fri Mar 19 13:37:08 PST 1999 by Ric Crabbe
Version: 2.0
Maintainer: Ric Crabbe <> and Amit Dubey <>
Size: 84098 bytes, 1871 lines

See reader questions & answers on this topic! - Help others by sharing your knowledge
Part 7: (FTP Resources):
  [7-1] AI Bibliographies available by FTP and WWW
  [7-2] Technical Reports available by FTP and WWW
  [7-3] Where can I get a machine readable dictionary, thesaurus, and
        other text corpora?
  [7-4] List of Smalltalk implementations.
  [7-5] AI-related CD-ROMs

Subject: [7-1] AI Bibliographies available by FTP


   The Computer Science Department at the University of Saarbruecken, Germany,
   maintains a large bibliographic database of articles pertaining to the
   field of Artificial Intelligence. Currently the database contains more
   than 25,000 references, which can be retrieved by electronic mail from
   the LIDO mailserver at Send a mail message with
   subject line "lidosearch help info" to get instructions on using the
   mail server. A variety of queries based on author names, title and
   year of publication are possible. The references can be provided in
   BibTeX or Refer formats. The entire bibliographic database can be
   obtained for a fee by ftp or on tape. Questions may be directed to

   A variety of AI-related bibliographies are available by anonymous ftp

[Maintainer's note: nexus doesn't seem to be accepting anonymous logins]
[anymore.  Does anyone have a new link?]

   Stanford University (SUMEX-AIM) has a large BibTeX bibliography of
   Artificial Intelligence papers and technical reports. Available by
   anonymous ftp from{1,2,3}.bib

[Maintainer's note: this one doesn't seem to be working, either]

   A large collection of BibTeX bibliographies (290,000+ references) on a
   variety of subjects, including artificial intelligence (29,402
   entries), neural networks (8,111 entries), and object-oriented
   programming (3,493 entries), is available by anonymous ftp from  []

   and in the mirror sites

   or by WWW from

   Some of the bibliographies prohibit commercial use. For more
   information, see the README file, or write to Alf-Christian Achilles
   <> or <>. 

   Glimpse, a searchable interface to the UKA and other
   bibliographies, is accessible as

   Write to for more information.

   OFAI Library Bibliography, in Austria

Fuzzy Logic:
   A BibTeX database of references addressing neuro-fuzzy issues can be
   obtained by anonymous ftp from []

   as the (ascii) file fuzzy-nn.bib.

Genetic Algorithms:

   A bibliography of over 400 Evolutionary Computation references (GA,
   ES, EP, GP) is available by anonymous ftp from
      [Maintainer's note: this seems to be out-of-date]

   The file EC-ref.bib.Z is in BibTeX format; is a postscript
   version of the bibliography. Please send additions and corrections to or

   Other Genetic Algorithm bibliography sites include:

Logic Programming, Constraints:

   A BibTeX bibliography for Constraint Logic Programming is available
   by anonymous ftp from

   in the bib/ and papers/ subdirectories.


   For information on a fairly complete bibliography of computational
   linguistics and natural language processing work from the 1980s, send
   mail to with the subject HELP. 

   The CSLI linguistics bibliography contains 3,300 entries in
   bib/tib/refer format. The bibliography is heavily slanted towards
   phonetics and phonology but also includes a fair amount of
   computational morphology, syntax, semantics, and psycholinguistics.
   The bibliography can be used with James Alexander's tib
   bibliography system, which is available from
   [] among other places. The bibliography itself is available
   by anonymous ftp from

   Contributions are welcome, but should be in tib format.
   For more information, contact Andras Kornai <>


   Robert Dale's Natural Language Generation (NLG) bibliography is
   available by anonymous ftp from [] 

   Note that it is formatted for A4 paper. Stick in a line 
      .94 .94 scale
   after the %! line to print on 8.5 x 11 paper. For further information,
   write to Robert Dale, University of Edinburgh, Centre for Cognitive
   Science, 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, or
   <> or <>.

   Mark Kantrowitz's Natural Language Generation (NLG) bibliography is
   available by anonymous ftp from [] 

   In addition to the tech report, the BibTeX file containing the
   bibliography is also available.  The bibliography contains more than
   1,200 entries. A searchable index to the bibliography is
   available via the URL

   Additions and corrections should be sent to 

Neural Nets, Learning:

   A bibliography of over 1000 entries about Self-Organizing Map
   (SOM) and Learning vector Quantization (LVQ) studies is
   available by anonymous ftp from

   as the files references.bib.Z (BibTeX file) and
   (PostScript file). Please send additions and corrections to

   An extensive collection of references on Principal Component Analysis
   (PCA) neural networks and learning algorithms is available by
   anonymous ftp from in LaTeX and PostScript
   formats. The list was compiled by Liu-Yue Wang, a graduate student of
   Erkki Oja, and updated by Juha Karhunen, all from Helsinki University
   of Technology, Finland.  For more information, contact Erkki Oja

   A bibliography of PCA algorithms is available by anonymous ftp from as pca.bib. For more information,
   contact Terry Sanger <>.

   A 36-page bibliography of connectionist models with symbolic
   processing is available by anonymous ftp from Neuroprose []

   as the file For more information, contact
   Ron Sun <>.

Nonmonotonic Logic, Belief Revision:

   A bibliography on belief revision and nonmonotonic logics with
   about 2,000 items is available by anonymous ftp from  []

   as nonmono.bib or nonmono.bib.Z. The file is also available by WAIS as


   and by gopher/WWW. Please send additions and corrections to Raymundo
   Morado <>. 


   A bibliography of papers on Silicon Auditory Models (VLSI
   implementations of auditory representations) is available by anonymous
   ftp from

   For more information, write to John Lazzaro <>

Subject: [7-2] Technical Reports available by FTP This section lists the anonymous ftp sites for technical reports from several universities and other organizations. Some of the sites provide only an online catalog of technical reports, while the rest make the actual reports available online. The email address listed is that of the appropriate person to contact with questions about ordering technical reports. When ftping compressed .Z files, remember to set the transfer type to binary first, using the command ftp> binary Other general locations for technical reports from several universities include: [] (see Index for an index) AKA [] The uwaterloo archive includes tech reports from the Logic Programming and Artificial Intelligence Group (LPAIG) of the University of Waterloo. There is also a WAIS server containing tech report abstracts that can be searched. To use, create the file ~/wais-sources/cs-techreport-abstracts.src containing (:source :version 3 :ip-address "" :ip-name "" :tcp-port 210 :database-name "cs-techreport-abstracts" :cost 0.00 :cost-unit :free :maintainer "") and invoke your local wais client. To add to it, email abstracts of your papers to in the following format: %TI Title %AU Author (use multiple %AU lines for multiple authors) %PU Published In (citation information) %AV Availability (e.g., ftp %OR Organization (see cs-techreport-archives.src for institution codes) %LT Local title (e.g., tech report number) %DA Date (and, if you want, %MN Month, %YR Year) %AB Abstract If your papers are not available by FTP, you can use a %AV line such as: %AV mail Further instructions are available from [Based on a post by Ashwin Ram.] Also see the Unified Computer Science Technical Report Index A list of FTP sites for technical reports and papers can be found in A list of more than 230 sites publishing CS tech reports may be obtained by anonymous ftp from To receive notification of new tech report sites, send mail to to join the mailing list. An archive of linguistics papers and preprints is available from Contact John Lawler ( or for more information. The Concurrent Engineering Research Center (CERC) at West Virginia University has placed ASCII versions of the concurrent engineering-related abstracts (over 500) that were on CERCnet, ASCII back issues of the Concurrent Engineering Research in Review journal (now discontinued), and Postscript copies of CERC technical reports in the gopher server In addition, many of the CERC technical reports, including journal articles, symposium papers, theses, dissertations, and issues of the Concurrent Engineering Research in Review journal, are available as Postscript versions via anonymous ftp from [] An index to all the reports, including some that are available only in hardcopy, is contained in the file "CERC-TR-INDEX". If you need additional information, contact Mary Carriger, CERC Office of Information Services, at The newsgroup comp.doc.techreports is devoted to distributing lists of tech reports and their abstracts. MIT Artificial Intelligence Laboratory: ftp -- email -- browse -- telnet www -- A full catalog of MIT AI Lab technical reports (and a listing of recent updates) may be obtained from the above location, by writing to Publications, Room NE43-818, M.I.T. Artificial Intelligence Laboratory, 545 Technology Square, Cambridge, MA 02139, USA, or by calling 1-617-253-6773. The catalog lists the technical reports ("AI Memos") with a short abstract and their current prices. There is also a charge for shipping. Some recent tech reports (since 1991) are available in the ai-publications/ subdirectory; older technical reports are NOT available by ftp. A bibliography is in the bibliography/ directory. CMU School of Computer Science: ftp -- email -- www -- CMU Software Engineering Institute: ftp -- email -- www -- Yale: ftp -- University of Washington CSE Tech Reports: ftp -- email -- ================ AT&T Bell Laboratories: ftp -- bib.Z contains short bibliography, including all the technical reports contained in this directory. ftp -- [Maintainer's note: I assume these have been moved over to Lucent's domain?] Argonne National Laboratory: ftp -- email -- Contains MCS Division preprints and technical memoranda, available as either .dvi or .ps files. For descriptions of the contents, see the subdirectory pub/tech_reports/abstracts; for the files themselves see the subdirectory pub/tech_reports/reports. Boston University: ftp -- email -- Brown University: ftp -- email -- Cambridge University: Speech, Vision & Robotics Group ftp -- Columbia University: ftp -- email -- DEC Cambridge Research Lab: ftp -- DEC Paris Research Lab: email -- Put commands in Subject: line of the message. To get a list of articles, use send index articles To get a list of tech reports, use send index reports DEC WRL: email -- To get a helpfile, send a message with help in the subject line. DFKI: ftp -- email -- Martin Henz ( Duke University: ftp -- email -- [unknown user, 7/7/93] Edinburgh: A list of available reports can be sent via email. Send requests for information about reports from the Center for Cognitive Science to, and from the Human Communication Research Center to Electrotechnical Laboratory, Japan: Reports from the Cooperative Architecture project (half AI, half software engineering). ftp -- [] See file Index.English. email -- Hideyuki Nakashima <>. Georgia Tech College of Computing, AI Group: ftp -- ( email -- Professor Ashwin Ram <> HCRC (Human Communication Research Centre): ftp -- mail -- Fiona-Anne Malcolm Human Communication Research Centre 2 Buccleuch Place, Edinburgh, UK Illinois: email -- Erna Amerman <> Illinois Genetic Algorithms Laboratory (IlliGAL): email -- Eric Thompson <> phone -- 217-333-2346 (9AM to 5PM CT, M-F) mail -- Illinois Genetic Algorithms Laboratory Department of General Engineering 117 Transportation Building 104 South Mathews Avenue Urbana, IL 61801-2996 ftp -- Includes the GA bibliography and the Messy GA code in C (in /pub/src/) and preprints (in /pub/papers/Publications) www -- Indiana: ftp -- [] ftp -- [] INRIA, France: ftp -- Institute for Learning Sciences at Northwestern University: ftp -- phone -- 708-491-3500 Mechanized Reasoning Group (MRG): ftp -- email -- Fausto Giunchiglia <> Mechanized Reasoning Group, IRST 38050 Povo Trento, Italy Tel: +39 461-314444 (secr.) +39 461-314436 (office) Fax: +39 461-302040 / 314591 National University of Singapore: ftp -- New York University (NYU): ftp -- OGI: ftp -- email -- Ohio State University, Laboratory for AI Research ftp -- email -- OSU Neuroprose: ftp -- ( This directory contains technical reports as a public service to the connectionist and neural network scientific community which has an organized mailing list (for info: Includes several bibliographies. Stanford: ftp -- Very spotty collection. SRI: email -- Donna O'Neal, SUNY Buffalo: ftp -- SUNY at Stony Brook: ftp -- email -- or The /pub/sunysb directory contains the SB-Prolog implementation of the Prolog language. Contact for more information. TCGA (The Clearinghouse for Genetic Algorithms): email -- Robert Elliott Smith <> Department of Engineering of Mechanics Room 210 Hardaway Hall The University of Alabama PO Box 870278 Tuscaloosa, AL 35487 205-348-1618, fax 205-348-6419 Thinking Machines: ftp -- This file contains a list of Thinking Machines technical reports. Orders may be placed by email (limit 5) to, or by US Mail to Thinking Machines Corporation, Attn: Technical reports, 245 First Street, Cambridge, MA 01241. In addition, the directories cm/starlisp and cm/starlogo contain code for the *Lisp and *Logo simulators. Tulane University: ftp -- [] University of Alabama: ftp -- University of Arizona: ftp -- email -- The directory /japan/kahaner.reports contains reports on AI in Japan, among other things, written by Dr. David Kahaner, a numerical analyst on sabbatical to the Office of Naval Research-Asia (ONR Asia) in Tokyo from NIST. The reports are not written in any sort of official capacity, but are quite interesting. University of California/Los Angeles: ftp -- University of California/Santa Cruz: ftp -- email -- University of Cambridge Computer Lab: email -- University of Colorado: ftp -- University of Florida: ftp -- University of Genoa, Mechanized Reasoning Group: ftp -- email -- Fausto Giunchiglia <> University of Georgia: ftp -- University of Illinois at Urbana: ftp -- email -- University of Indiana, Center for Research on Concepts and Cognition: ftp -- email -- University of Kaiserslautern, Germany: ftp -- University of Kentucky: ftp -- University of Massachusetts at Amherst: email -- University of Melbourne, Australia, Computer Vision and Pattern Recognition Laboratory (CVPRL): ftp -- University of Michigan: ftp -- University of North Carolina: ftp -- University of Pennsylvania: ftp -- email -- [email bounced 7/7/93] USC/Information Sciences Institute: email -- Sheila Coyazo <> is the contact. [email bounced 7/7/93] University of Toronto: ftp -- (Cognitive Robotics) email -- University of Virginia: ftp -- University of Western Australia: ftp -- Centre for Intelligent Information Processing Systems (CIIPS) EE Engineering Department University of Wisconsin: ftp -- email -- Some AI authors have set up repositories of their own papers: Matthew Ginsberg:
Subject: [7-3] Where can I get a machine readable dictionary, thesaurus, and other text corpora? Free: /usr/dict/words Roget's 1911 Thesaurus is available by anonymous FTP from the Consortium for Lexical Research [] It is also available from An old Webster's dictionary is in /text/dict/{DICT.Z,DICT.INDEX.Z}. Project Gutenberg also has Roget's 1911 Thesaurus. The Project Gutenberg archive is at The Project Gutenberg archive collects public domain electronic books. For more information, write to Michael S. Hart, Professor of Electronic Text, Executive Director of Project Gutenberg Etext, Illinois Benedictine College, 5700 College Road, Lisle, IL 60532 or send email to For people without FTP, Austin Code Works sells floppy disks containing Roget's 1911 Thesaurus for $40.00. This money helps support the production of other useful texts, such as the 1913 Webster's dictionary. The Online Book Initiative maintains a text repository on (a public access UNIX system, 617-739-WRLD). See the README file on For more information, send email to, write to Software Tool & Die, 1330 Beacon Street, Brookline, MA 02146, or call 617-739-0202. The CHILDES project at Carnegie Mellon University has a lot of data of children speaking to adults, as well as the adult written and adult spoken corpora from the CORNELL project. Contact Brian MacWhinney <> for more information. The Association for Computational Linguistics (ACL) has a Data Collection Initiative. For more information, contact Donald Walker at Bellcore, Two lists of common female first names (4967 names) and male first names (2924 names) are available for anonymous ftp from Read the file README first. Send mail to for more information. A list of 110,000 English words (one per line, in ASCII) is available in the PD1:<MSDOS.LINGUISTICS> directory on SIMTEL20 as the files WORDS1.ZIP, WORDS2.ZIP, WORDS3.ZIP, and WORDS4.ZIP. Although the list is in MS-DOS files, it can easily be used on other machines (but first you'll have to unzip the files on a DOS machine). The list includes inflected forms of the words, such as plural nouns and the -s, -ed, and -ing forms of verbs; thus the number of lexical stems in the list is considerably smaller than the total number of word forms. These files are available via FTP from WSMR-SIMTEL20.ARMY.MIL []. SIMTEL20 files are mirrored on The Collins English Dictionary encoded as a Prolog fact base is available from the Oxford Text Archive by anonymous ftp from [] The Oxford Text Archive includes many other texts, dictionaries, thesauri, word lists, and so on, most of which are available for scholarly use and research only. See the files for more information, or write to, Oxford Text Archive, Oxford University Computing Services, 13 Banbury Road, Oxford OX2 6NN, UK, call 44-865-273238 or fax 44-865-273275. Chuck Wooters <> has extracted the most likely pronunciation for each of about 6100 words in the hand-labeled TIMIT database, and made them available by anonymous ftp from A list of homophones from general American English is available by anonymous ftp from as the file homophones-1.01.txt. To receive the list by email, send mail to The list was compiled by Tony Robinson. Sigurd P. Crossland <> has been compiling a dictionary of English words, including most common American words, abbreviations, hyphenations, and even incorrect spellings. The most recent version is available by anonymous ftp from The tar file includes 31 text files, one for each word-length from 2 to 32. The compressed tar file takes up just over 4mb of space, and includes approximately 870,000 words. WordNet is an English lexical reference system based on current psycholinguistic theories of human lexical memory. It organizes nouns, verbs and adjectives into synonym sets corresponding to lexical concepts. The sets are linked by a variety of relations. Besides being of scientific interest, it makes a handy thesaurus. WordNet is available by anonymous ftp from If you retrieve a copy of wordnet by ftp, please send mail to Commercial: Illumind publishes the Moby Thesaurus (25,000 roots/1.2 million synonyms), Moby Words (560,000 entries), Moby Hyphenator (155,000 entries), and the Moby Part-of-Speech (214,000 entries), Moby Pronunciator (167,000 entries with IPA encoding, syllabification, and primary, secondary, and tertiary stress marks) and Moby Language (100,000 word word lists in five major world languages) lexical databases. All databases are supplied in pure ASCII, royalty-free, in both Macintosh and MS-DOS disk formats (also in .Z file formats). Both commercial (to resell derived structures as part of commercial applications) and educational/research licenses are available. Samples of each of the lexical databases are available by anonymous ftp from []. For more information, write to Illumind, Attn: Grady Ward, 3449 Martha Court, Arcata, CA 95521, call/fax 707-826-7715, or send email to [Maintainer's note: This contact information is no longer valid. We're working on finding a current address.] The Oxford Text Archive has hundreds of online texts in a wide variety of languages, including a few dictionaries (the OED, Collins, etc.). The Lancaster-Oslo-Bergen (LOB), Brown, and London-Lund corpii are also available from them. For more information, write to Oxford Electronic Publishing, Oxford University Press, 200 Madison Avenue, New York, NY 10016, call 212-889-0206, or send mail to (Their contact information in England is Oxford Text Archive, Oxford University Computing Service, 13 Banbury Road, Oxford OX2 6NN, UK, +44 (865) 273238.) Mailing Lists: CORPORA is a mailing list for Text Corpora. It welcomes information and questions about text corpora such as availability, aspects of compiling and using corpora, software, tagging, parsing, and bibliography. To be added to the list, send a message to Contributions should be sent to Linguistic Data Consortium: The Linguistic Data Consortium was established to broaden the collection and distribution of speech and natural language data bases for the purposes of research and technology development in automatic speech recognition, natural language processing, and other areas where large amounts of linguistic data are needed. Information about the LDC is available by anonymous ftp from []. Documents available in this directory include a paper on the background, rationale and goals of the LDC, a brief list of available data bases, and some tables summarizing these corpora. For further information, contact Elizabeth Hodas, <>, Mark Liberman <>, or Jack Godfrey <>.
Subject: [7-4] List of Smalltalk implementations. Little Smalltalk -- Tim Budd's version of Smalltalk GNU Smalltalk
Subject: [7-5] AI-related CD-ROMs Prime Time Freeware for AI: Prime Time Freeware for AI is an annual CD-ROM collection of Artificial Intelligence freeware source code and documentation. Prime Time Freeware for AI in no way modifies the legal restrictions on any package it includes. Each issue consists of two ISO-9660 CD-ROMs, bound into a 224 page book. The current issue (1-1; July 1994) includes a selection of the contents of the CMU AI Repository (see [5-1]), including most of the AI Programming Languages section and most of the AI Software Packages section. Thus the CD-ROMs contain nearly every free implementation of Lisp, Prolog, Scheme, and Smalltalk, including graphical user interfaces, object-oriented programming extensions, and other software development tools. They also contain the most complete collection of free software in every area of artificial intelligence research and practice, including Artificial Life, Expert Systems, Fuzzy Logic, Genetic Algorithms, Knowledge Representation, Machine Learning, Natural Language Understanding and Generation, Neural Networks, Planning, Reasoning, Speech Recognition and Synthesis, and Theorem Proving, and much, much more. All of the more than 1,300 packages are extensively annotated and indexed, with programs for searching the index included on the CDs. Since the CD-ROMs use gzip for compression, this means that Prime Time Freeware for AI contains more than 5,000 megabytes of AI-related software. Prime Time Freeware for AI is targeted at AI researchers, educators, students, and practitioners. Prime Time Freeware for AI is particularly useful for programmers who do not have FTP access, but may also be useful as a way of saving disk space and avoiding annoying FTP searches and retrievals. Prime Time Freeware helped establish the CMU AI Repository, and sales of Prime Time Freeware for AI will continue to help support the expansion and maintenance of the repository. The product sells (list) for $60 US plus applicable sales tax and shipping and handling charges. Payable through Visa, Mastercard, postal money orders in US funds, and checks in US funds drawn on a US bank. Thus Prime Time Freeware for AI offers more than twice the contents of the NCC AI CD-ROM. For more information write to Prime Time Freeware 370 Altair Way, Suite 150 Sunnyvale, CA 94086 USA Tel: 408-433-9662 Fax: 408-433-0727 E-mail: NCC AI CD-ROM: The AI CD-ROM Revision 3 is available from Network Cybernetics Corporation for $89.00 per copy (plus $3 shipping domestic, $8 shipping international). The AI CD-ROM is an ISO-9660 format disk usable on any computer system, and contains a variety of public domain, shareware, and other software of special interest to the AI community. The disk contains source code, executable programs, demonstration versions of commercial programs, tutorials and other files for a variety of operating systems. Among the supported operating systems are DOS, OS/2, Mac, Amiga, and Unix. Among the items included are the latest version NASA software such as CLIPS v6, NETS, and SPLICER, the collected source code from AIExpert magazine from the premier issue in June of 1986 to the present, and complete transcriptions of the first annual Loebner Prize competition. It also includes examples many different kinds of neural networks, genetic algorithms, artificial life simulators, natural language software, public domain and shareware compilers for a wide range of languages such as Lisp, Xlisp, Scheme, XScheme, Smalltalk, Prolog, ICON, SNOBOL, and many others. Complete collections of the Neural Digest, Genetic Algorithms Digest, and Vision List Digest are included. Most files on the disk are compressed in ZIP format. Macintosh specific files are in BinHex v4 (.HQX) format. Network Cybernetics Corporation releases annual revisions to the AI CD-ROM to keep it up to date with current developments in the field. For more information, write to Network Cybernetics Corporation, 4201 Wingren Road, Suite 202, Irving, Texas 75062-2763, call 214-650-2002, fax 214-650-1929, or send email to ---------------------------------------------------------------- --- [ is moderated. To submit, just post and be patient, or if ] [ that fails mail your article to <>, and ] [ ask your news administrator to fix the problems with your system. ]

User Contributions:

Apr 4, 2023 @ 1:13 pm
You actually said it wonderfully!
english essay writer unique college essay

Comment about this article, ask questions, or add new information about this topic:

Part1 - Part2 - Part3 - Part4 - Part5 - Part6 - Part7 - MultiPage

[ Usenet FAQs | Web FAQs | Documents | RFC Index ]

Send corrections/additions to the FAQ Maintainer:

Last Update March 27 2014 @ 02:11 PM