ABSTRACT: We describe a methodology for processing data retrieval and update queries using a form-based natural language interface. For the purpose of illustration, we use computer integrated manufacturing (CIM) as the application domain. The interface consists of a set of fourth-generation interface tools (SQL forms), a set of form definitions, a lexicon, and a parser. The forms are developed from the functional and data models of the system. A form definition consists of a form name, a form object, a set of form fields, and a set of fragment grammars. A form object is a single or composite entity that uniquely identifies a form. Form fields consist of database fields whose values can be entered by users (user-defined), and others whose values can be derived by the system (system-defined). Fragment grammars are templates that identify the information requested by user queries. The lexicon consists of all words recognized by the system, their grammatical categories, synonyms, and associations (if any) with database objects and forms. The parser scans a natural language query to identify a form in a bottom-up fashion. The information requested by the user query is determined in a top-down manner by matching the fragment grammars associated with a form against the user query. Extragrammatical inputs with limited deviations from the grammar rules are supported. Elliptical queries are supported by deriving the missing information from those specified in previous queries and forms. Combining a natural language processor with SQL forms allows update queries and prevents violation of database integrity constraints, duplication of records, and invalid data entry.
Key words and phrases: database management systems, complementarity, natural language interfaces, query processing