IBM Intelligent Miner for Text for OS/ Version 2 Release 3 Enables You to Extract Key Information Efficiently from Large Quantities of Text
Announcement Letter Number:
Table of Contents:At a Glance
IBM Intelligent Miner for Text offers system integrators, solution providers, and application developers a wide range of sophisticated text analysis tools, an extended full-text search engine, enhanced with text mining functions, and a Web Crawler to enrich business intelligence, content management, and knowledge management solutions.
Intelligent Miner for Text for OS/ V offers the following key features:
- Text Analysis Tools
- Language identification to discover the language of a document
- Clustering to group related documents by contents
- Categorization to assign documents to a set of pre-defined categories
- Summarization of documents
- Feature extraction to identify key elements of free-text
- Extended Text Search Engine to search textual information and to uncover related concepts with Java-based samples for GUI application development
- Web Crawler package consisting of a toolkit and a ready-to-run Web Crawler
- Flexible and ready-to-use NetQuestion Web-search Solution
- Full-product, limited-time trial version for workstations at no charge
For ordering, contact: Your IBM representative, an IBM Business Partner, or IBM North America Sales Centers at IBM-CALL Reference: LE
IBM Intelligent Miner (TM) for Text is a knowledge discovery software development toolkit. It contains tools for application programmers who want to build applications to extract key information from very large quantities of documents, e-mails, or Web pages stored online, often in the Internet or intranets, without having to read them all. With IBM Intelligent Miner for Text, you can:
- Organize the documents by subject, find the predominant themes in a collection of documents, and summarize them.
- Search for relevant documents using powerful and flexible queries -- and more.
Intelligent Miner for Text for OS/ (R) Version extends platform support from AIX (R), Sun Solaris, and Windows NT (R) to the mainframe and features three major components:
- IBM Text Analysis Tools: Include a Language Identification tool, comprehensive Clustering tools, a Topic Categorization tool, a Summarization tool, and Feature Extraction tools. These tools identify document language, group conceptually related documents, classify documents by content, generate document summaries, and extract key elements of text.
- IBM Text Search Engine: Extends the capabilities of the Text Search Engine provided with the OS/ operating system, transforming it into a comprehensive search engine which is customizable for either sophisticated full-text search (including text mining functions) or Web-tuned search functions. It is enhanced with Java (TM) and Java Beans samples to help build applications for text search and administrative functions accessible from a Java-enabled browser.
- IBM Web Crawler Package: Consists of a ready-to-run Web Crawler and a Web Crawler toolkit to build customized Web crawlers.
The Intelligent Miner for Text toolkit also features an enhanced IBM NetQuestion Solution, a powerful Internet/intranet text-search solution, based on the Text Search Engine and Web Crawler, which extends the search scope of the OS/provided NetQuestion Solution from a local OS/ domain to many servers across a Web space.
OS/ V, or higher, with the provided UNIX (R) System Services, TCP/IP, and a Web server. The Web Crawler requires DB2 (R) V For details, refer to the Hardware Requirements and Software Requirements sections.
Planned Availability Date
This announcement is provided for your information only. For additional information, contact your IBM representative, call IBM-4YOU, or visit the IBM home page at: lasolidariacr.com
IBM Intelligent Miner for Text for OS/ Version 2 Release 3 is a knowledge discovery software development toolkit providing three major components to build advanced-technology information retrieval and mining applications and a Web-search solution. It consists of:
- IBM Text Analysis tools
- Extended IBM Text Search Engine*
- IBM Web Crawler package
It also provides an enhanced NetQuestion Solution* using the Text Search Engine and the Web Crawler to support your e-business in the form of a drop-in solution. * Base functions of this component and solution are provided with the OS/ operating system.
A description of the complete set of functions available after installation of Intelligent Miner for Text for OS/ follows. Base functions provided with the OS/ operating system are listed in the Base Text Search Functions Provided with OS/ section under Supplemental Information.
Text Analysis Tools: The Text Analysis tools introduce a state-of-the-art toolset for text analysis, text mining, and knowledge management. They can be used to identify the language of documents, intelligently classify documents by content, discover clusters of conceptually related documents, summarize documents to automatically create short descriptions or summaries, and extract key elements of free text. The documents need to be provided in plain text format. For other formats, conversion tools can be obtained from third parties.
- Language Identification
This tool analyzes a document or string, and identifies its language. Testing to date indicates a high rate of accuracy, even on short input. The tool supports the 14 languages -- Catalan, Danish, Dutch, English, Finnish, French, German, Icelandic, Italian, Norwegian Bokmal, Norwegian Nynorsk, Portuguese, Spanish, and Swedish. The language identifier is extensible. A training tool is included that can be used to add a language not yet recognized.
By analyzing key concepts, the Clustering tools automatically find groups of related documents in document collections, such as news feeds, patents, or technical reports. The clusters are created dynamically without requiring a predefined taxonomy. Titles for clusters are generated as short lists of the key concepts that are characteristic for the documents contained in the cluster.
Intelligent Miner for Text includes two different approaches to clustering: binary relational clustering and hierarchical clustering. Binary relational clustering is a top-down approach that splits the collection into clusters at points of maximal difference, while hierarchical clustering is a bottom-up approach that incrementally puts similar documents together in groups. Both approaches usually provide different results when applied to the same data. A practical way of using the tools is to try both approaches with different parameter settings, review the results, and select the one which is most suitable for the task at hand.
The Topic Categorization tool assigns documents to one or more categories from a user-defined taxonomy. Possible applications include automatically sorting documents into a Yahoo-like schema. A training tool is included that allows you to define your own taxonomy and build reliable categorizers for many applications.
The Summarization tool extracts sentences from a document to create a document summary. It works best with well-edited structured documents.
- Feature Extraction
The Feature Extraction tools recognize different kinds of significant items in text, such as proper names, technical terms, relations, or abbreviations.
The training tool creates a scheme of significant features from a document collection. The document extraction tool identifies features in documents either by using a set of extraction functions (exploration mode) or by looking them up in a scheme created by the training tool (lookup mode). Both the training tool and the document extraction tool use the same set of extraction functions based on heuristics and dictionary information.
The exploration mode of the document extraction tool should be used for finding significant features in an isolated document while the lookup mode should be used when rating the content of a given document with respect to the contents of a document collection. In lookup mode, the document extraction tool extracts only those features from a document that also occur in the scheme and provides statistical information about the occurrences of these features in the corresponding collection.
The names extraction function recognizes names even when they occur in different forms, such as "Robert Jordan" versus "Mr. Jordan" and distinguishes between names of persons, organizations, or locations, such as "Houston, Texas" and "Whitney Houston."
When recognizing terminology, the terminology extraction function automatically finds many multiword terms that have a meaning of their own, for example, "laser printer", and recognizes different forms of the same term, such as "expense account" and "expense accounts."
The relation extraction function finds information of the type "R. Jordan is_CIO_of XY Corp.," "XY Corp. produces handheld computers," or "R. Jordan has_age "
The abbreviation extraction function finds and links abbreviations introduced in a text together with their full forms, such as "American Bar Association" versus "ABA."
Other kinds of significant entities are also recognized by other extraction functions, such as dates, numbers, and money amounts (for example "$50," "50 Dollars," " EUR," or " Euros").
Apart from the Language Identification tool, the Text Analysis tools currently work with English text only.
The Text Analysis Tools are designed in such a way that the output from one tool can be used as input to another. This allows you to create powerful toolsets to satisfy your requirements.
Text Search Engine
The Text Search Engine is an advanced search engine that is able to perform in-depth document analysis during indexing. It allows for sophisticated query enhancement and result preparation in order to supply high-quality information retrieval. The most important components are client/server handling, linguistic support for different languages, and document analysis algorithms. In addition, the Text Search Engine features an online update mechanism that allows you to search while the index is being updated. As soon as the update is complete, the newly indexed documents are available for search.
The Text Search Engine provides two user exits that enable you to:
- Access document repositories or library systems to get documents for indexing or preprocessing before indexing. This allows you to integrate the Text Search Engine with any document management system. The product supports various document source formats including plain text and text with HTML markup.
- Convert your specific input formats not explicitly supported, by means of conversion tools or filtering tools that can be obtained from third parties.
The Text Search Engine supports the new Euro code pages and features a broad range of functions accessible through published programming interfaces. Administration can also be performed through command line functions.
The functions for application programming depend, however, strongly on the natural language they support. Therefore, the functions described below are categorized into the three language groups: single-byte character set (SBCS), bi-directional character set (BIDI), and double-byte character set (DBCS) languages.
Functions Supported for SBCS Languages: For the following 19 single-byte character set languages -- Brazilian Portuguese, Canadian French, Catalan, Danish, Dutch, Finnish, French, German, Icelandic, Italian, Norwegian Bokmal, Norwegian Nynorsk, Portuguese, Russian, Spanish, Swedish, Swiss German, U.K. English, and U.S. English -- the Text Search Engine features:
- Multilingual morphological analysis and lemmatization.
- Advanced relevance ranking.
- Boolean queries allowing for phrase and proximity searches as well as for front-, middle-, and end-masking using wildcards, and nested sub-queries.
- Free-text queries based on probabilistic logic.
- IBM's advanced hybrid query, that enables mixing Boolean terms with free-text queries.
- Sophisticated lexical affinities-based ranking for free-text and hybrid queries.
- Fuzzy searches using an n-gram index.
- Thesaurus support providing query expansion through a given thesaurus as well as construction of a user-defined thesaurus.
- Section support for query restriction to certain sections of a document, such as a title or author field.
- Match information sufficient to develop viewers with highlighting capabilities
- Java sample graphical user interface, an attractive and user-friendly sample graphical user interface (GUI) in English that is written in Java. It runs as an applet on any client machine that hosts a Java-capable browser as described in the Software Requirements section. It allows you easy access to the search engine from any point of the Internet or your intranet. Included in the Java sample GUI is the visualization of clustered result lists. This is shipped as compiled Java. All other components of the sample GUI, available in the form of source files and their makefiles, offer most of the Text Search Engine capabilities based on its C-language application programming interface (API) and can easily be modified for your special needs.
- Java Beans samples based on the state-of-the-art Java Beans component model architecture, are offered as two adaptable sample GUIs in English for:
The samples, provided as source files, show how to develop reusable components for search and administrative tasks in a flexible way and how to combine them into attractive and useful GUIs.
- Search functions
- Administrative functions
The sample administration GUI can be used as a standalone Java application on a TCP/IP-connected workstation. You can use the administration GUI to perform some of the Text Search Engine administration functions, such as creating an index, and monitoring and changing the status of an index.
In addition, the following features are available for English documents:
- Clustering of the result list, which eases the user's comprehension of search results.
- Query refinement methods based on user-assigned relevance.
- Feature index. Recognized features like proper names, locations, or terms may be used to obtain higher precision of queries. This index allows a search for documents about, for example, President George Washington. The result will not include documents that contain information about the city of Washington, DC.
Functions Supported for BIDI Languages: For the following two BIDI languages -- Arabic and Hebrew -- the Text Search Engine supports:
- Multilingual morphological analysis and lemmatization
- Advanced relevance ranking
- Boolean queries
- Free-text queries based on probabilistic logic
- Hybrid queries
The document source format must be in logical format.
Functions Supported for DBCS Languages: For the following four double-byte character set languages -- Japanese, Korean, Simplified Chinese, and Traditional Chinese -- the Text Search Engine supports through an n-gram index:
- Boolean queries
- Precise term search
- Fuzzy search
- Thesaurus support providing query expansion through a given thesaurus as well as construction of a user-defined thesaurus
- Match information
Scope of Text Search Engine Usage: Beyond developing enterprise applications, the Text Search Engine can also be used to build a global Internet search service or a centralized intranet search service in support of your e-business initiative. The Text Search Engine provides functions to optimally handle the large amounts of information that are typically stored on Web sites.
The Text Search Engine offers the choice of the different base index types: precise, linguistic, and n-gram. Each type differs with respect to:
- Indexing speed
- Size of index produced
- Complexity of the queries the end user can perform
- Target languages the documents are written in
The trade-offs should be considered during application development.
The Text Analysis Tools and the Text Search Engine can interoperate in various aspects by means of application development to satisfy end-user requirements in the knowledge management area.
Web Crawler Package: A Web crawler is a robot that starts at one or more Web sites and follows selected HTML links you must define in a customization step prior to execution. In addition to defining the domain you want to crawl, including types and number of levels of HTML links, you must specify additional parameters, such as selection criteria for objects to be found on the Web and the directory in a file system where you want the crawler store the retrieved objects. The Web crawler can retrieve objects of any content type and language, such as HTML, text, images, audio, or video, and will store them to the defined directory for further processing. For example, an indexer can use HTML and other text documents to build an index of the documents. After processing, it is the user's responsibility to delete the retrieved objects.
The Web Crawler toolkit provided with Intelligent Miner for Text allows you to develop Web crawlers according to your needs.
A ready-to-run implementation, simply called IBM Web Crawler, is included with the product. The Web Crawler:
- Can run on a single machine and can spawn off a user-specified number of crawler copies that run in parallel.
- Allows individual crawl results, consisting of data objects and their metadata, to be shared for subsequent processing. The data objects are stored as flat files; whereas the metadata, for example, consisting of URL, size, and last modification date of each data object, and crawler-specific control data is stored in DB2.
- Allows for controlled restart due to the persistent and save storage of the metadata in DB2.
- Provides socks support and, as such, is able to crawl the Web from inside a firewall.
- Provides a UNIX command line interface.
- Monitors Web-page activities and changes.
NetQuestion Solution: The NetQuestion Solution is a powerful Internet/intranet text-search solution based on the Text Search Engine and Web Crawler. Although it is a drop-in solution, it allows you enough flexibility to meet specific needs. It provides easy-to-use installation and configuration, and selection of objects found on the Web, and extension of the search scope of the OS/ NetQuestion Solution from a local domain to many servers across a Web space.
After installation, a simple configuration step must be completed, and the solution is ready to run. You specify the Web space -- the portion of the Internet or intranet you wish to search. The IBM Web Crawler gathers the pages that will be searched, and the IBM Text Search Engine is able to search the pages after indexing them.
Part of the NetQuestion Solution is a search form and an associated CGI script which allow you to easily define your queries through a Web browser. The search form and the CGI script are also provided in sample C and HTML source code to allow you to modify the input definition of your queries and the presentation of the results. You can exploit the full functionality of the Text Search Engine by implementing all functions of the published API. In addition, you can perform most of the search engine's administration functions also through forms and CGI scripts and a Web browser.
In some circumstances, the NetQuestion Solution even allows you to detect misspellings in documents and expand your search request accordingly. For example, if one of the occurrences of "Toyota" is misspelled as "Toyotta" in a document and someone later tries to search for "Toyota," the solution automatically adds "Toyotta" to the query.
The NetQuestion Solution and associated components provide key technologies to build intelligent Internet or intranet Web sites. They allow you to leverage the use of the Internet and intranets to gain access to relevant information and support your e-business initiative. For those users wishing to tailor this solution, a full range of settings can be configured.
This product is Year ready. When used in accordance with its associated documentation, it is capable of correctly processing, providing, and/or receiving date data within and between the twentieth and twenty-first centuries, provided that all products (for example, hardware, software, and firmware) used with the product properly exchange accurate date data with it.
The service end date for this Year ready product is January 31,
With Intelligent Miner for Text, IBM extends your technology assets based on the full range of business intelligence solutions available to you, including DB2, Intelligent Miner for Data, and KnowledgeX.
Similar to data mining, text mining discovers patterns in document collections and other unstructured information. The Text Search Engine is not a typical search tool. Together with the Text Analysis Tools it performs categorization and clustering that are necessary when processing large volumes of information.
Complemented with the Web Crawler, you can develop comprehensive solutions to support your e-business initiative.
For more information about the product, refer to the Web page at:
HARDWARE AND SOFTWARE SUPPORT SERVICES
SmoothStart (TM)/Installation Services
IBM SmoothStart Services, an on-site implementation and training startup service designed to accelerate your productive use of your IBM solution, is provided by IBM Global Business Intelligence Solutions (GBIS) and selected IBM Business Partners at an additional cost. For additional information on IBM SmoothStart Services, refer to Services Announcement dated March 25, , or contact your IBM representative and ask for SmoothStart Services for Intelligent Miner for Text for OS/
IBM Installation Services are provided for Intelligent Miner for Text for OS/ by GBIS and selected IBM Business Partners at an additional cost. For additional information, contact your IBM representative and ask for Installation Services for Intelligent Miner for Text for OS/
Questions regarding IBM SmoothStart or Installation Services can also be sent to the following e-mail address:
- Software Announcement dated December 8, (IBM Intelligent Miner for Text Version 2 Release 3 Supports Sun Solaris)
- Software Announcement dated September 22, (IBM KnowledgeX for Workgroup Edition Version )
- Software Announcement dated September 22, (IBM Intelligent Miner for Data for OS/ Version )
Trademarks Intelligent Miner and SmoothStart are trademarks of International Business Machines Corporation in the United States or other countries or both. OS/, AIX, and DB2 are registered trademarks of International Business Machines Corporation in the United States or other countries or both. Windows NT is a registered trademark of Microsoft Corporation. Java is a trademark of Sun Microsystems, Inc. UNIX is a registered trademark in the United States and other countries exclusively through X/Open Company Limited. Other company, product, and service names may be trademarks or service marks of others.
BASE TEXT SEARCH FUNCTIONS PROVIDED WITH OS/ (R)
The Description section identifies the full range of functions as found in the OS/ operating system and in Intelligent Miner (TM) for Text for OS/ Below is the set of text search functions provided with the OS/ operating system.
- IBM Text Search Engine
- Full-text search functions using a:
- Precise index
- Linguistic index
- N-gram index
- Search support for 19 SBCS, two BIDI, and four DBCS languages
- Boolean queries
- Free-text queries
- Fuzzy searches
- Thesaurus support (not for n-gram indexes)
- Relevance ranking
- IBM NetQuestion Solution for a single Web server
- Full-text search service using the IBM Text Search Engine
- Ready-to-run for documents stored on the OS/ domain using the Web server provided by the OS/ operating system
- Accessible through TCP/IP-connected workstations providing an HTML browser
An Intelligent Miner for Text Application Programming Workshop, class number DW56, provided by IBM Education and Training, will be available after planned general availability.
Visit the following Web site for additional information:
Descriptions of all classroom and self-study courses are contained in the Catalog of IBM Education and Training.
Call IBM Education and Training at IBM-TEACH () for catalogs, schedules, and enrollments.
You can find live solution demonstrations on the Web at:
Specified Operating Environment
Hardware Requirements: Intelligent Miner for Text for OS/ V runs on System/ (R) processors supported by OS/ Version , or higher.
The Text Search Engine and its Java (TM) sample GUI have been designed to operate as clients on TCP/IP-connected workstations. The following defines the required hardware:
- Clients on AIX (R) workstations
Any processor of the RS/ (TM) System's family. A minimum of 64 MB random access memory (RAM) is required.
- Clients on Sun Solaris workstations
Any processor of in the Sun SPARC System's family. A minimum of 64 MB RAM is required.
- Clients on Windows NT (R) workstations
Any Pentium (TM)-based processor with a minimum of 64 MB RAM.
Additional RAM may be needed based on the size of data stored in memory and execution speed required.
The following minimum disk space is required and valid for all supported workstation platforms:
+ | | Intelligent Miner for Text for OS/ | | | | |Disk |Text |Text |TSE |Web |Net- |Online | |Space |Analysis|Search|Java |Crawler|question |Docu- | |in MB |Tools |Engine|sample|Package|Solution |ment- | | | |(TSE) |GUI | | |ation | |++++++| |OS/ | 40 | 10 | 10 | 10 | 5 | 30 | | | | | | | | | |Work- | | | | | | | |station| -- | 55 | 10 | -- | -- | | '++++++'
Additional disk space needed depends on the amount of data processed per run and developed Intelligent Miner for Text applications.
Software Requirements: The following software is required to run Intelligent Miner for Text for OS/ V on your system:
- IBM OS/ V, or higher, product number A01
All releases require the base Text Search Engine (TSE) and the NetQuestion Solution for a single Web server (NQS) installed before installing Intelligent Miner for Text for OS/ You can find download information for TSE and NQS at:
- OS/ UNIX (R) System Services included with the OS/ base.
- TCP/IP UNIX Services included with the OS/ base.
The communication between parts of Intelligent Miner for Text for OS/ including the communication between OS/ and connected workstations is based on TCP/IP.
- C/C++ compiler included with the OS/ base in order to compile samples provided with the TSE Java sample GUI.
- Language Environment included with the OS/ base.
- Java for OS/ (A46; JDK V) with PTF UW, or higher for the TSE Java sample GUI.
- For the TSE Java sample GUI and the NetQuestion Solution the Web server (HTTP server) provided by the respective release of the OS/ base. For example, the WebSphere Application Server for OS/ V
- DB2 (R) V, or higher for the Web Crawler package.
- The online documentation is provided in PDF and HTML format for workstations where Adobe Acrobat Reader V or higher, or any Web browser supporting HTML V or higher, is needed. Refer to the Displayable Softcopy Publications section for more detail.
At the present time, you can download the Adobe Acrobat Reader, presently at no charge, from the Web at:
Text Search Engine Client/Server Combinations: Applications using the TSE API invoke the TSE client. The TSE client must be installed on OS/ and can be installed on one of the client workstations described below. The TSE clients for the workstations are provided with the Intelligent Miner for Text V Trial Version shipped with the product. ++ | Software | TSE | TSE Java | | | client | sample GUI | |++| | AIX V | x | x | | A C/C++ compiler, such as | x (1)| | | IBM CSet++ V | x (1)| | | JDK V (3) | | x (1) | | Java Runtime Environment | | x (2) | | (JRE, included in JDK) | | x (2) | | Any Java Vcapable HTML | | x | | browser (3) like | | x | | HotJava (TM) V (3), (4) | | | | A visual builder for Java Beans,| | x (1) | | such as IBM VisualAge (R) | | | | for Java or BDK from Sun | | | '++' ++ | Software |TSE |TSE Java | | |client |sample GUI| |++| | Sun Solaris V | x | x | | A C/C++ compiler, such as | x (1)| | | Sun's Workshop C/C++ | x (1)| | | Compiler V | | | | JDK V including native | | x (1) | | Thread Patch (3) | | x (1) | | Java Runtime Environment | | x (2) | | (JRE, included in JDK) | | x (2) | | An HTML browser, such as | | x | | Netscape Navigator V4 (3) | | | | with the Java Plug-in V (3),| | | | or any Java Vcapable | | | | browswer (3) like | | | | HotJava V (3), (4) | | | | A visual builder for Java Beans, | | x (1) | | such as Java Studio, or | | | | Sun's Beans Development Kit | | | '++' ++ | Software |TSE |TSE Java | | |client |sample GUI| |++| | Windows NT V plus | x | x | | Service Pack 3 | x | x | | A C/C++ compiler, such as | x (1)| | | Microsoft (TM) Visual C++ V | | | | JDK V (3) | | x (1) | | Java Runtime Environment | | x (2) | | (JRE, included in JDK) | | x (2) | | An HTML browser, such as | | x | | Microsoft Internet Explorer V (3)| | | | or Netscape Navigator V4 (3) | | | | both with the | | | | Java Plug-in V (3), or | | | | or any Java Vcapable | | | | browser (3) with | | | | HotJava V (3), (4) | | | | A visual builder for Java Beans, | | x (1) | | such as IBM VisualAge for Java | | | '++'
(1) Optional, only required for application development (2) Optional, only required to run standalone Java applications (3) Or higher (4) At the present time, you can download HotJava, presently at no charge, from the Web at:
NetQuestion Solution Client/Server Combinations: The NetQuestion Solution on OS/ requires an HTML browser, such as:
- Netscape Navigator V4 for AIX
- Netscape Navigator V4 for Sun Solaris/SPARC
- Microsoft Internet Explorer V4 for Windows NT V
- Netscape Navigator V4 for Windows NT
running on any possible TCP/IP-connected workstation.
Installability: The installation of Intelligent Miner for Text for OS/ complements the existing functions Text Search Engine and NetQuestion Solution provided with the OS/ operating system.
Packaging: The Intelligent Miner for Text for OS/ V program package contains:
- Memo to customers
- License Programming Specifications
- Program Directory
- Intelligent Miner for Text for OS/ on either 9/ magnetic tape, IBM cartridge, or 4-mm DAT cartridge
- Getting Started hardcopy documentation
- Intelligent Miner for Text V Trial Version on CD-ROM Media Pack containing the online product documentation. Refer to the Displayable Softcopy Publications section for more detail.
Note: Most of the Intelligent Miner for Text code will be delivered as object code only. Header source files and libraries required for programming against the API, and sample command files are also included in the media. In addition, Java sample source code and sample Java Beans for developing GUIs are provided.
Security, Auditability, and Control
The announced program uses the security and auditability features of the OS/ operating system, the network systems used, and DB2. The customer is responsible for evaluation, selection, and implementation of security features, administrative procedures, and appropriate controls in application systems and communication facilities.
Orders for new licenses will be accepted now.
Shipment will begin on the planned availability date.
New users of Intelligent Miner for Text for OS/ V should specify:
Basic License: To order a basic license, specify the program number and feature number for asset registration. For an Entry Support License charge or Parallel Sysplex (R) License charge specify one of the following feature numbers as applicable and corresponding to the capacity of the designated machine. Specify the feature number of the desired distribution medium shown below.
Entry Support License (ESL): To order an ESL license, specify the program number, feature number for asset registration, and the applicable ESL One-Time Charge (OTC) feature number. Also specify the feature number of the desired distribution medium.
ESL OTC Program Feature Number Description Number
MTX Intelligent Miner for Text for OS/ V
Note: ESL machines can be determined by referring to the IBM Entry End User/ Attachment (Z).
Parallel Sysplex License Charge (PSLC) Basic License: To order a basic license, specify the program number and feature number for asset registration. Specify the PSLC Base feature. If applicable, specify the PSLC Level A, PSLC Level B, and PSLC Level C features and quantity.
If there is more than one program copy in a Parallel Sysplex, the charge for all copies is associated to one license by specifying the applicable PSLC feature numbers and quantity represented by the sum of the Service Units in Millions (MSUs) in your Parallel Sysplex. For all other program copies, specify the PSLC No-Charge (NC) Identifier feature on the licenses.
Also, specify the feature number of the desired distribution medium.
PSLC PSLC Basic License Machine Feature MLC Feature MSU Capacity Number Description
1 PSLC Base, 1 MSU 2 PSLC Base, 2 MSUs 3 PSLC Base, 3 MSUs
4 -- 45 PSLC Level A, 1 MSU PSLC Level A, 42 MSUs
46 or more PSLC Level B, 1 MSU PSLC Level B, 10 MSUs PSLC Level B, 50 MSUs
or more PSLC Level C, 1 MSU PSLC Level C, 10 MSUs PSLC Level C, 50 MSUs
NA PSLC N/C ID
Example 1: For a single machine with 11 MSUs, the PSLC features would be -- quantity 1 and -- quantity 8.
Example 2: For two machines in a Parallel Sysplex which have an aggregation of 60 MSUs, the PSLC features would be:
- PSLC chargeable license #1: -- quantity 1, -- quantity 1, -- quantity 5, and -- quantity 1
- PSLC no-charge license #2: -- quantity 1
Single Version Charging: To elect single version charging, the customer must notify and identify to IBM the prior program and replacement program and the designated machine the programs are operating on.
Version-to-Version Upgrade Credit: To upgrade from a prior program acquired for a one-time charge to a replacement program using a version-to-version upgrade credit, the customer must notify and identify to IBM the applicable prior program and replacement program participating in the upgrade credit.
Basic Machine-Readable Material: To order, select the feature number of the desired distribution medium:
Feature Distribution Environment Number Medium
OS/ SMP/E 9/ magnetic tape cartridge 4-mm DAT cartridge
Customization Options: Select the appropriate feature numbers to customize your order to specify the delivery options desired. These features can be specified on the initial or MES orders.
Example: If publications are not desired for the initial order, specify feature number to ship media only. For future updates, specify feature number to ship media updates only. If, in the future, publication updates are required, order an MES to remove feature number ; then, the publications will ship with the next release of the program.
Feature Description Number
Serial Number Only (suppresses shipment of media and documentation)
Ship Media Only (suppresses initial shipment of documentation)
Ship Documentation Only (suppresses initial shipment of media)
Ship Media Updates Only (suppresses update shipment of documentation)
Ship Documentation Only (suppresses update shipment of media)
Suppress Updates (suppresses update shipment of media and documentation)
Local IBM Office Expedite (for IBM use only)
Customer Expedite Process Charge ($30 charge for each product)
Expedite shipments will be processed to receive hour delivery from the time IBM Software Delivery Solutions (SDS) receives the order. SDS will then ship the order via overnight air transportation.
Unlicensed Documentation: A memo, program directory, and one copy of the following publications are supplied automatically with the basic machine-readable material.
Order Title Number
Intelligent Miner for Text Licensed Programming GH Specifications Getting Started* SH
* All available platforms, OS/, AIX, Sun Solaris, and Windows NT, are covered by the same publication.
Additional copies of unlicensed publications will be available for a fee after availability. These copies may be ordered from your IBM representative, through the System Library Subscription Service (SLSS), or by direct order.
The following optional publications will be available after product availability:
Order Title Number
Intelligent Miner for Text* Fact Sheet GC Getting Started SH Text Analysis Tools SH Text Search Engine: Customization and Administration SH Text Search Engine: Programming Interfaces SH NetQuestion Solution SH Web Crawler SH
* All available platforms, OS/, AIX, Sun Solaris, and Windows NT, are covered by the same publications.
Displayable Softcopy Publications: The Intelligent Miner for Text publications are offered in displayable softcopy form. All unlicensed manuals are included except the Licensed Programming Specifications. The displayable manuals are part of the Intelligent Miner for Text Trial Version provided on CD-ROM and shipped with the Intelligent Miner for Text for OS/ package.
Intelligent Miner for Text* Getting Started (US English and additional languages) HTML, PDF Text Analysis Tools HTML, PDF Text Search Engine: Customization and Administration HTML, PDF Text Search Engine: Programming Interfaces HTML, PDF NetQuestion Solution HTML, PDF Web Crawler HTML, PDF
* All available platforms, OS/, AIX, Sun Solaris, and Windows NT, are covered by the same publications.
The Intelligent Miner for Text Fact Sheet can be displayed and printed through the Web with an HTML browser from the following URL:
The displayable manuals can be displayed with an HTML browser or with Adobe Acrobat Reader, respectively. These files can be used to create unmodified printed copies of the manuals. Terms and conditions for use of the machine-readable files are shipped with the files.
Full-text online documentation search is provided for all publications provided in HTML.
Subsequent updates (technical newsletters or revisions between releases) to the publications shipped with the product will be distributed to the user of record for as long as a license for this software remains in effect. A separate publication order or subscription is not needed.
The following unlicensed publications will be available for a fee after availability. These copies may be ordered from your IBM representative, through the System Library Subscription Service (SLSS), or by direct order.
Order Title Number Language
Intelligent Miner for Text* Getting Started S Brazilian-Portuguese SB Arabic SH French SH German SH Italian SH Japanese SA Korean SH Simplified Chinese SH Spanish SC Traditional Chinese
Intelligent Miner for Text V Trial Version: Intelligent Miner for Text can be ordered after planned availability as a full-product, limited-time trial version as one Media Pack containing CD-ROMs for the workstation platforms AIX, Sun Solaris, and Windows NT at no charge. All displayable manuals are included in HTML and PDF format covering all platforms including OS/ To order, contact your IBM representative.
Order Title Number
Intelligent Miner for GK2T Text Version 60 Day Trial License
The Intelligent Miner for Text servers will cease their operation sixty days after installation. In order to maintain an ongoing operation, a full license program package must be purchased and the supplied license key must be installed by executing one of the provided license key programs, either on AIX, Sun Solaris, or on Windows NT.
TERMS AND CONDITIONS
Licensing: IBM Customer Agreement
Variable Charges Apply: No
Indexed Monthly License Charge (IMLC) Applies: No
Location License Applies: No
Use Limitation Applies: No
Educational Allowance Available: Yes, to qualified education customers
Volume Orders: Not applicable
Version-To-Version Upgrade Credits Apply: Yes
Single Replaced Programs Replacement Programs Version Program Program Program Program Charging Number Name Number Name Applies
MTX Intelligent To a follow-on, if any NA Miner for Text for OS/ V (This is the first version release to synchronize with Intelligent Miner for Text V, IMT)
NA = Not Applicable
Warranty Applies: Yes
Licensed Program Materials Availability
- Restricted Materials of IBM: None
- Non-Restricted Source Materials: Some
- Object Code Only (OCO): Some
Publication that identifies OCO components: SH SH SH SH SH
Availability Date: January 29,
Testing Period: Two months (Basic License only)
Program Services: None
Support Line: S/ (R)
CALL NOW TO ORDER
To order, contact the IBM North America Sales Centers, your local IBM representative, or your IBM Business Partner.
IBM North America Sales Centers, our national direct marketing organization, can add your name to the mailing list for catalogs of IBM products.
Phone: IBM-CALL Fax: IBM-FAX Internet: email@example.com Mail: IBM North America Sales Centers Dept. LE P.O. Box Atlanta, GA Reference: LE
To identify your local IBM Business Partner or IBM representative, call IBM-4YOU.
Note: Shipments will begin after the planned availability date.
Intelligent Miner and RS/ are trademarks of International Business Machines Corporation in the United States or other countries or both. OS/, System/, AIX, DB2, VisualAge, Parallel Sysplex, and S/ are registered trademarks of International Business Machines Corporation in the United States or other countries or both. Pentium is a trademark of Intel Corporation. Microsoft is a trademark of Microsoft Corporation. Windows NT is a registered trademark of Microsoft Corporation. Java and HotJava are trademarks of Sun Microsystems, Inc. UNIX is a registered trademark in the United States and other countries exclusively through X/Open Company Limited. Other company, product, and service names may be trademarks or service marks of others.