US20160140187A1 - System and method for answering natural language question - Google Patents
System and method for answering natural language question Download PDFInfo
- Publication number
- US20160140187A1 US20160140187A1 US14/737,130 US201514737130A US2016140187A1 US 20160140187 A1 US20160140187 A1 US 20160140187A1 US 201514737130 A US201514737130 A US 201514737130A US 2016140187 A1 US2016140187 A1 US 2016140187A1
- Authority
- US
- United States
- Prior art keywords
- documents
- index
- question
- meanings
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 230000001788 irregular Effects 0.000 claims abstract description 49
- 239000000284 extract Substances 0.000 claims description 12
- 238000010801 machine learning Methods 0.000 claims description 6
- 238000010586 diagram Methods 0.000 description 13
- 230000008901 benefit Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 241000143957 Vanessa atalanta Species 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000001149 cognitive effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- NHDHVHZZCFYRSB-UHFFFAOYSA-N pyriproxyfen Chemical compound C=1C=CC=NC=1OC(C)COC(C=C1)=CC=C1OC1=CC=CC=C1 NHDHVHZZCFYRSB-UHFFFAOYSA-N 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
Images
Classifications
-
- G06F17/30554—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
-
- G06F17/271—
-
- G06F17/2785—
-
- G06F17/30011—
-
- G06F17/30321—
-
- G06F17/30401—
-
- G06F17/30598—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Definitions
- the present invention relates to a system and method for answering a natural language question, and more particularly, to a system and method for answering a natural language question in which sentences or paragraphs of irregular documents are analyzed and the documents are classified and indexed according to meanings and used to provide an answer to a question, so that information retrieval performance can be improved.
- a question-answering system provides a correct answer as a result of a question.
- Most question-answering systems search documents or paragraphs first and extract a correct answer from the searched documents or paragraphs.
- results of linguistic analysis such as morpheme analysis and syntax analysis, are used.
- a previously proposed method of building a question-answering information retrieval engine for a natural language in Korean on the Internet discloses an Internet information retrieval method of showing a user secondary and tertiary re-query text using a database in which user questions in the form of the natural language are accumulated to let the user select a result corresponding to query text.
- “Question-answering system for extracting a correct answer using a syntax structure” discloses a question-answering system which uses a query language extension and correct answer extraction technique centering on a verb included in a question. Conjugation of verbs uses information of a constructed verb syntax dictionary, and a noun semantic dictionary is used to eliminate the vagueness of verbs.
- the present invention relates to a system and method for answering a natural language question, and is directed to providing a system and method for answering a natural language question in which sentences or paragraphs of irregular documents are analyzed and the documents are classified and indexed according to meanings and used to provide an answer to a question, so that information retrieval performance can be improved.
- a system for answering a natural language question including: an index unit configured to analyze text of previously stored irregular documents and classify and index the irregular documents according to meanings of sentences or paragraphs; a database configured to store the irregular documents indexed according to the meanings by the index unit; a retrieval unit configured to extract an index word by semantically analyzing an input question, and search the database for documents related to the extracted index word; and a provision unit configured to generate a correct answer to the question by analyzing the documents searched by the retrieval unit, and provide results of the search and the generated correct answer.
- the database may include a plurality of index databases classified according to indices.
- the retrieval unit may search for documents in an index database corresponding to the index word among the plurality of index databases.
- the retrieval unit may extract a plurality of index words by analyzing the question.
- the retrieval unit may search for documents in respective index databases corresponding to the plurality of index words.
- the index unit may include: a document analyzer configured to analyze the text of the previously stored irregular documents; a semantic classifier configured to receive the text analyzed by the document analyzer and classify the meanings of the received text in units of sentences or paragraphs; and a document indexer configured to index sentences or paragraphs classified by the semantic classifier according to the meanings.
- the document analyzer may perform morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis on the text.
- the semantic classifier may classify the meanings by extracting sentence features and generating patterns or by using a machine learning technique.
- the document indexer may index the sentences or the paragraphs in units of morphemes, entity names, phrases, syntax structures, semantic structures, sentence structures, “subject-verb” structures, “object-verb” structures, and “subject-verb-object” structures.
- the retrieval unit may include: a question input portion configured to receive the question from an outside of the system; a question analyzer configured to analyze the question input through the question input portion; a question classifier configured to receive the question analyzed by the question analyzer, classify a meaning, and extract the index word; and a document search portion configured to search the database for documents related to the index word extracted by the question classifier.
- the provision unit may provide results of the search according to weights previously given to the respective index words.
- a method of answering a natural language question including: analyzing previously stored irregular documents, and classifying and indexing the irregular documents according to meanings of sentences or paragraphs in the irregular documents; transmitting the irregular documents indexed according to the meanings to a database and storing the indexed irregular documents in the database; when a question is input, analyzing the question to extract an index word, and searching the database for documents related to the extracted index word; and analyzing the searched documents to generate a correct answer to the question, and providing results of the search and the generated correct answer.
- the sentences or the paragraphs in the irregular documents may be classified based on structural information or results of understanding a natural language.
- the classifying and the indexing of the irregular documents may include: analyzing text of the previously stored irregular documents; receiving the analyzed text and classifying the meanings of the received text in units of sentences or paragraphs; and indexing the sentences or the paragraphs classified according to the meanings.
- the classifying and the indexing of the irregular documents may include classifying one sentence or paragraph in the documents into two or more meanings.
- the indexing of the sentences or the paragraphs classified according to the meanings may include indexing the sentences or the paragraphs in units of morphemes, entity names, phrases, syntax structures, semantic structures, “subject-verb” structures, “object-verb” structures, and “subject-verb-object” structures.
- the analyzing of the text of the previously stored irregular documents may include performing morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis on the text.
- the searching of the database for documents related to the extracted index word may include searching for documents in an index database corresponding to the index word among a plurality of index databases classified according to indices.
- the classifying of the meanings of the received text in units of sentences or paragraphs may include classifying the meanings by extracting sentence features and generating patterns or by using a machine learning technique.
- the providing of the results of the search and the generated correct answer may include, when the database is searched with a plurality of index words, providing results of the search according to weights previously given to the respective index words.
- FIG. 1 is a block diagram of a system for answering a natural language question according to an exemplary embodiment of the present invention
- FIG. 2 is detailed block diagram of an index unit shown in FIG. 1 ;
- FIGS. 3A to 3C is a diagram showing an example of results of analyzing text of a document by a document analyzer shown in FIG. 2 ;
- FIG. 4 is a diagram showing an example of paragraphs into which a semantic classifier shown in FIG. 2 according to an exemplary embodiment of the present invention structurally divides a news document;
- FIG. 5 is a diagram showing an example of an index-target sentence indexed according to index units by a document indexer shown in FIG. 2 according to an exemplary embodiment of the present invention
- FIG. 6 is detailed block diagram of a retrieval unit of the system for answering a natural language question shown in FIG. 1 according to an exemplary embodiment of the present invention
- FIG. 7 is a diagram showing an example of a screen provided by a provision unit shown in FIG. 1 ;
- FIG. 8 is an operational flowchart illustrating a method of answering a natural language question according to an exemplary embodiment of the present invention.
- FIG. 1 is a block diagram of a system for answering a natural language question according to an exemplary embodiment of the present invention.
- a system for answering a natural language question may include a storage unit 110 , an index unit 120 , a database 130 , a retrieval unit 140 , and a provision unit 150 .
- the storage unit 110 stores various kinds of irregular data in the form of documents, and the irregular data stored in the storage unit 110 may be acquired via various routes in the World Wide Web.
- the irregular data denotes a large amount of data collected from various channels, such as news, research papers, patents, dictionaries, blogs, online forums, and Facebook.
- the index unit 120 analyzes text of the irregular documents stored in the storage unit 110 , and classifies and indexes the irregular documents according to meanings of sentences or paragraphs. At this time, the index unit 120 transmits the irregular documents indexed according to the meanings to the database 130 .
- the database 130 receives and stores the irregular documents indexed according to the meanings and transmitted from the index unit 120 .
- a plurality of irregular documents it is preferable for a plurality of irregular documents to be grouped according to indices and stored in the database 130 , and it is preferable for the database 130 to be divided so that the grouped irregular documents may be adjacently stored in a predetermined region.
- the database 130 may include a plurality of index databases classified according to indices.
- the retrieval unit 140 extracts an index word by semantically analyzing an input question, and searches the database 130 for documents related to the extracted index word.
- the retrieval unit 140 searches for documents in an index database corresponding to the index word among the plurality of index databases classified according to indices rather than in all regions of the database 130 .
- the retrieval unit 140 may extract a plurality of index words by analyzing the input question. In this case, the retrieval unit 140 searches for documents in respective index databases corresponding to the respective index words.
- the provision unit 150 generates a correct answer to the question by analyzing documents searched by the retrieval unit 140 , and provides the search results and the correct answer.
- FIG. 2 is detailed block diagram of an index unit in the system for answering a natural language question shown in FIG. 1 according to an exemplary embodiment of the present invention.
- the index unit 120 of the system for answering a natural language question analyzes the text of irregular documents stored in the storage unit 110 shown in FIG. 1 , and classifies and indexes the irregular documents according to meanings of sentences or paragraphs.
- the index unit 120 may include a document analyzer 121 , a semantic classifier 123 , and a document indexer 125 .
- the document analyzer 121 analyzes the text of the irregular documents stored in the storage unit 110 .
- FIGS. 3A to 3C is a diagram showing an example of results of analyzing text of a document by the document analyzer 121 .
- the document analyzer 121 performs morpheme analysis, lexical analysis such as recognition of entity names, syntax analysis, and sentence structure analysis to apply various index units.
- Text which is analyzed in depth by the document analyzer 121 is subsequently used in a semantic classification operation as well as a document index operation.
- the semantic classifier 123 receives the text analyzed by the document analyzer 121 and classifies meanings of the received text. At this time, the semantic classifier 123 classifies the received text in units of sentences or paragraphs. In other words, the semantic classifier 123 receives the text from the document analyzer 121 and classifies the text in units of sentences or paragraphs according to meanings.
- FIG. 4 is a diagram showing an example of structurally divided paragraphs of a news document. As shown in FIG. 4 , sentences or paragraphs are divided based on structure information of a document or results of understanding a natural language.
- the semantic classifier 123 may classify the text in various categories according to a system request. For example, the text may be classified by work, evaluation, constitution, reason, effect, character, background of growth, and so on.
- the semantic classifier 123 may classify the text by extracting sentence features and generating patterns or by using a machine learning technique.
- the semantic classifier 123 may classify one sentence or paragraph into two or more meanings.
- the semantic classifier 123 may classify an example sentence “Sunsin Yi was born in Hanseong and passed the military examination in the middle period of the Joseon Dynasty” into two meanings “occupation” and “birth.”
- weights may be given to the respective meanings.
- weights of 0.7 and 0.3 may be given to “occupation” and “birth,” respectively.
- the document indexer 125 indexes documents in units of sentences or paragraphs classified according to meanings by the semantic classifier 123 .
- the document indexer 125 may index the documents in units of morphemes, entity names, phrases, syntax structures, and semantic structures, and may analyze sentence structures and perform indexing in units of 2-tuples (subject-verb and object-verb) and 3-tuples (subject-verb-object).
- FIG. 5 shows an example of an index-target sentence indexed according to index units in which “ ” is used as the index-target sentence.
- the index-target sentence of FIG. 5 includes the syntax structure “Subject( )-Object( ).”
- another sentence may represent the same meaning in another document.
- the other sentence may be “Subject( )-Object( )-Verb( ).”
- indexing is performed up to semantic structure units of sentences, and index databases are generated according to semantic classification of the sentences.
- FIG. 6 is detailed block diagram of the retrieval unit 140 of the system for answering a natural language question according to an exemplary embodiment of the present invention.
- the retrieval unit 140 of the system for answering a natural language question extracts an index word by semantically analyzing an input question, and searches the database 130 for documents having the extracted index word.
- the retrieval unit 140 may include a question input portion 141 , a question analyzer 143 , a question classifier 145 , and a document search portion 147 .
- the question input portion 141 is configured to receive a question from the outside of the system.
- the question input portion 141 may be a keyboard, a touchpad, etc., but is not limited thereto.
- the question analyzer 143 analyzes the question input through the question input portion 141 . At this time, the question analyzer 143 performs morpheme analysis, lexical analysis such as recognition of entity names, syntax analysis, and sentence structure analysis to apply various index units.
- the question classifier 145 receives the question analyzed by the question analyzer 143 , classifies a meaning of the question, and extracts an index word. At this time, the question classifier 145 classifies the received question in units of sentences or paragraphs, and two or more index words may be extracted by the question classifier 145 .
- the document search portion 147 searches the database 130 for documents related to the index word extracted by the question classifier 145 .
- the document search portion 147 searches for documents in an index database corresponding to the index word among a plurality of index databases classified according to indices rather than in all regions of the database 130 .
- the document search portion 147 searches for documents in respective index databases corresponding to the respective index words.
- the provision unit 150 generates a correct answer to the question by analyzing documents searched by the retrieval unit 140 , and provides the search results and the correct answer.
- the provision unit 150 analyzes the number of index words with which documents have been searched, the weight of each index word when documents have been searched with a plurality of index words, a document appropriate for a result of the question, and so on.
- the provision unit 150 determines the weights of the respective index words and provides search results according to the weights.
- search results based on an index word having the small weight may not be provided to increase importance of index words having large weights.
- FIG. 7 is a diagram showing an example of a screen provided by a provision unit according to an exemplary embodiment of the present invention.
- search results and a correct answer of the question “Who is the admiral born in Hanseong and having passed the military examination in the middle period of the Joseon Dynasty?” are provided.
- search results and correct answers of two index words “occupation” and “birth,” which are extracted from the question “Who is the admiral born in Hanseong and having passed the military examination in the middle period of the Joseon Dynasty?,” are provided.
- the provision unit 150 may provide 60% of search results obtained from an occupation index database and 40% of search results obtained from a birth index database as search results.
- the provision unit 150 may not provide search results obtained from the birth index database and may provide only search results obtained from the occupation index database.
- FIG. 8 is an operational flowchart illustrating a method of answering a natural language question according to an exemplary embodiment of the present invention.
- the index unit 120 analyzes the text of irregular documents stored in the storage unit 110 (S 810 ), and classifies and indexes the documents according to meanings of sentences or paragraphs in the documents (S 820 ).
- a process of storing various kinds of irregular documents in the storage unit 110 may be further performed.
- the process of storing irregular documents in the storage unit 110 may include a process of acquiring information from various channels in the World Wide Web.
- morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis may be performed on the documents, and the documents may be classified by extracting sentence features and generating patterns or by using a machine learning technique.
- the sentences or the paragraphs in the documents may be classified based on structural information or results of understanding a natural language.
- one sentence or paragraph may be classified into two or more meanings.
- the documents may be indexed in units of morphemes, entity names, phrases, syntax structures, and semantic structures.
- the documents may also be indexed in units of 2-tuple structures, such as “subject-verb” and “object-verb” and 3-tuple structures, such as “subject-verb-object.”
- the index unit 120 After indexing the irregular documents in operation S 820 , the index unit 120 transmits the documents indexed according to meanings to the database 130 and stores the indexed documents in the database 130 (S 830 ).
- the retrieval unit 140 continuously determines whether a question is input (S 840 ).
- the retrieval unit 140 extracts an index word by analyzing the question (S 850 ), and searches the database 130 for documents related to the extracted index word (S 860 ).
- the retrieval unit 140 searches for documents in an index database corresponding to the index word among the plurality of index databases divided according to indices.
- a plurality of index words may be extracted.
- documents are searched in index databases corresponding to the respective extracted index words.
- the provision unit 150 analyzes the documents searched in operation S 860 and provides the search results and a correct answer of the question (S 870 ).
- semantically classified paragraphs or sentences are used as search targets instead of whole original documents.
- semantically related sentences or paragraphs are searched instead of whole documents, so that users can find desired information with little effort.
- a currently used search service provides a user with all documents which can be searched for using one search question, and thus the user is required to find desired information in the search results.
- it is analyzed what kind of information a user wants to obtain from a question, and only information wanted by the user is provided.
- questions and search targets are classified into semantic paragraphs and then indexed, and a user is provided with a document including a correct answer as well as the correct answer, so that the correct answer can be highly trusted by the user.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
- This application claims priority to and the benefit of Korean Patent Application No. 10-2014-0161904, filed on Nov. 19, 2014, the disclosure of which is incorporated herein by reference in its entirety.
- 1. Field of the Invention
- The present invention relates to a system and method for answering a natural language question, and more particularly, to a system and method for answering a natural language question in which sentences or paragraphs of irregular documents are analyzed and the documents are classified and indexed according to meanings and used to provide an answer to a question, so that information retrieval performance can be improved.
- 2. Discussion of Related Art
- Recently, an information retrieval system for processing information on countless web documents on websites, extracting only information corresponding to a user's request, and providing the extracted information to the user is widely being used.
- However, in general, it is very difficult to accurately extract documents wanted by an information requester from a huge set of web documents and obtain an accurate answer to a specific question.
- For this reason, unlike an existing search system which searches for documents having matched words, a natural language question-answer search system which understands a user's intention to recommend appropriate documents and a correct answer has emerged.
- In general, a question-answering system provides a correct answer as a result of a question. Most question-answering systems search documents or paragraphs first and extract a correct answer from the searched documents or paragraphs. Here, to search documents or paragraphs and extract a correct answer, results of linguistic analysis, such as morpheme analysis and syntax analysis, are used.
- However, there are still many errors in linguistic analysis results, and there is no way other than using such linguistic analysis results to extract a correct answer. Therefore, the overall performance of a question-answering system is rather low.
- A previously proposed method of building a question-answering information retrieval engine for a natural language in Korean on the Internet discloses an Internet information retrieval method of showing a user secondary and tertiary re-query text using a database in which user questions in the form of the natural language are accumulated to let the user select a result corresponding to query text.
- Also, “Question-answering system for extracting a correct answer using a syntax structure (reference literature: Daeyoen Lee and Yeonghun Seo, The 15th Annual Conference on Human and Cognitive Language Technology, pp. 89 to 94, 2003)” discloses a question-answering system which uses a query language extension and correct answer extraction technique centering on a verb included in a question. Conjugation of verbs uses information of a constructed verb syntax dictionary, and a noun semantic dictionary is used to eliminate the vagueness of verbs.
- In a knowledge-based question answering system for acquisition of concept word (reference literature: Jaehong Lee, Hoseop Choi, and Cheolyeong Ock, The 15th Annual Conference on Human and Cognitive Language Technology, pp. 95 to 100, 2003), a statistic-based knowledge base using a hybrid method and a lexicon-classification-based knowledge base are efficiently constructed centering on a Korean dictionary, an encyclopedia, etc. in which knowledge of the real world is systematically defined to some degree, and used.
- Such research for existing Korean question-answering systems has a model for extracting a correct answer using a keyword and syntax structure information. However, due to the low reliability of linguistic analysis results, the overall performance of the question-answering systems is low.
- In addition, according to existing general information search methods, original text having information similar to a question is searched, or results obtained by structurally dividing a document and searching the divided document are provided.
- However, in a natural language question-answering system, unnecessarily provided retrieval results may be misused and cause degradation of the overall performance of the system. This also results from misunderstanding of the point of a question and information requested by the question.
- Therefore, it is necessary to research a method for providing an accurate answer without causing such performance degradation of a question-answering system.
- The present invention relates to a system and method for answering a natural language question, and is directed to providing a system and method for answering a natural language question in which sentences or paragraphs of irregular documents are analyzed and the documents are classified and indexed according to meanings and used to provide an answer to a question, so that information retrieval performance can be improved.
- According to an aspect of the present invention, there is provided a system for answering a natural language question, the system including: an index unit configured to analyze text of previously stored irregular documents and classify and index the irregular documents according to meanings of sentences or paragraphs; a database configured to store the irregular documents indexed according to the meanings by the index unit; a retrieval unit configured to extract an index word by semantically analyzing an input question, and search the database for documents related to the extracted index word; and a provision unit configured to generate a correct answer to the question by analyzing the documents searched by the retrieval unit, and provide results of the search and the generated correct answer.
- The database may include a plurality of index databases classified according to indices.
- The retrieval unit may search for documents in an index database corresponding to the index word among the plurality of index databases.
- The retrieval unit may extract a plurality of index words by analyzing the question.
- The retrieval unit may search for documents in respective index databases corresponding to the plurality of index words.
- The index unit may include: a document analyzer configured to analyze the text of the previously stored irregular documents; a semantic classifier configured to receive the text analyzed by the document analyzer and classify the meanings of the received text in units of sentences or paragraphs; and a document indexer configured to index sentences or paragraphs classified by the semantic classifier according to the meanings.
- The document analyzer may perform morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis on the text.
- The semantic classifier may classify the meanings by extracting sentence features and generating patterns or by using a machine learning technique.
- The document indexer may index the sentences or the paragraphs in units of morphemes, entity names, phrases, syntax structures, semantic structures, sentence structures, “subject-verb” structures, “object-verb” structures, and “subject-verb-object” structures.
- The retrieval unit may include: a question input portion configured to receive the question from an outside of the system; a question analyzer configured to analyze the question input through the question input portion; a question classifier configured to receive the question analyzed by the question analyzer, classify a meaning, and extract the index word; and a document search portion configured to search the database for documents related to the index word extracted by the question classifier.
- When the database is searched with a plurality of index words, the provision unit may provide results of the search according to weights previously given to the respective index words.
- According to another aspect of the present invention, there is provided a method of answering a natural language question, the method including: analyzing previously stored irregular documents, and classifying and indexing the irregular documents according to meanings of sentences or paragraphs in the irregular documents; transmitting the irregular documents indexed according to the meanings to a database and storing the indexed irregular documents in the database; when a question is input, analyzing the question to extract an index word, and searching the database for documents related to the extracted index word; and analyzing the searched documents to generate a correct answer to the question, and providing results of the search and the generated correct answer.
- The sentences or the paragraphs in the irregular documents may be classified based on structural information or results of understanding a natural language.
- The classifying and the indexing of the irregular documents may include: analyzing text of the previously stored irregular documents; receiving the analyzed text and classifying the meanings of the received text in units of sentences or paragraphs; and indexing the sentences or the paragraphs classified according to the meanings.
- The classifying and the indexing of the irregular documents may include classifying one sentence or paragraph in the documents into two or more meanings.
- The indexing of the sentences or the paragraphs classified according to the meanings may include indexing the sentences or the paragraphs in units of morphemes, entity names, phrases, syntax structures, semantic structures, “subject-verb” structures, “object-verb” structures, and “subject-verb-object” structures.
- The analyzing of the text of the previously stored irregular documents may include performing morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis on the text.
- The searching of the database for documents related to the extracted index word may include searching for documents in an index database corresponding to the index word among a plurality of index databases classified according to indices.
- The classifying of the meanings of the received text in units of sentences or paragraphs may include classifying the meanings by extracting sentence features and generating patterns or by using a machine learning technique.
- The providing of the results of the search and the generated correct answer may include, when the database is searched with a plurality of index words, providing results of the search according to weights previously given to the respective index words.
- The above and other objects, features and advantages of the present invention will become more apparent to those of ordinary skill in the art by describing in detail exemplary embodiments thereof with reference to the accompanying drawings, in which:
-
FIG. 1 is a block diagram of a system for answering a natural language question according to an exemplary embodiment of the present invention; -
FIG. 2 is detailed block diagram of an index unit shown inFIG. 1 ; -
FIGS. 3A to 3C is a diagram showing an example of results of analyzing text of a document by a document analyzer shown inFIG. 2 ; -
FIG. 4 is a diagram showing an example of paragraphs into which a semantic classifier shown inFIG. 2 according to an exemplary embodiment of the present invention structurally divides a news document; -
FIG. 5 is a diagram showing an example of an index-target sentence indexed according to index units by a document indexer shown inFIG. 2 according to an exemplary embodiment of the present invention; -
FIG. 6 is detailed block diagram of a retrieval unit of the system for answering a natural language question shown inFIG. 1 according to an exemplary embodiment of the present invention; -
FIG. 7 is a diagram showing an example of a screen provided by a provision unit shown inFIG. 1 ; and -
FIG. 8 is an operational flowchart illustrating a method of answering a natural language question according to an exemplary embodiment of the present invention. - Advantages and features of the present invention and a method of achieving the same will be more clearly understood from embodiments described below in detail with reference to the accompanying drawings. However, the present invention is not limited to the following embodiments and may be implemented in various different forms. The embodiments are provided merely for complete disclosure of the present invention and to fully convey the scope of the invention to those of ordinary skill in the art to which the present invention pertains. The present invention is defined only by the scope of the claims. Throughout the specification, like reference numerals refer to like elements.
- In describing the present invention, any detailed description of related art of the invention will be omitted if it is deemed that such a description will obscure the gist of the invention unintentionally. In addition, terms used below are defined in consideration of functions in the present invention, which may be changed according to the intention of a user or an operator, or a practice, etc. Therefore, the definitions of these terms should be made based on the overall description of this specification.
- Hereinafter, a system and method for answering a natural language question according to exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.
-
FIG. 1 is a block diagram of a system for answering a natural language question according to an exemplary embodiment of the present invention. - Referring to
FIG. 1 , a system for answering a natural language question according to an exemplary embodiment of the present invention may include astorage unit 110, anindex unit 120, adatabase 130, aretrieval unit 140, and aprovision unit 150. - The
storage unit 110 stores various kinds of irregular data in the form of documents, and the irregular data stored in thestorage unit 110 may be acquired via various routes in the World Wide Web. Here, the irregular data denotes a large amount of data collected from various channels, such as news, research papers, patents, dictionaries, blogs, online forums, and Facebook. - The
index unit 120 analyzes text of the irregular documents stored in thestorage unit 110, and classifies and indexes the irregular documents according to meanings of sentences or paragraphs. At this time, theindex unit 120 transmits the irregular documents indexed according to the meanings to thedatabase 130. - The
database 130 receives and stores the irregular documents indexed according to the meanings and transmitted from theindex unit 120. Here, it is preferable for a plurality of irregular documents to be grouped according to indices and stored in thedatabase 130, and it is preferable for thedatabase 130 to be divided so that the grouped irregular documents may be adjacently stored in a predetermined region. - Therefore, the
database 130 may include a plurality of index databases classified according to indices. - The
retrieval unit 140 extracts an index word by semantically analyzing an input question, and searches thedatabase 130 for documents related to the extracted index word. - When searching the
database 130 for documents, theretrieval unit 140 searches for documents in an index database corresponding to the index word among the plurality of index databases classified according to indices rather than in all regions of thedatabase 130. - The
retrieval unit 140 may extract a plurality of index words by analyzing the input question. In this case, theretrieval unit 140 searches for documents in respective index databases corresponding to the respective index words. - The
provision unit 150 generates a correct answer to the question by analyzing documents searched by theretrieval unit 140, and provides the search results and the correct answer. -
FIG. 2 is detailed block diagram of an index unit in the system for answering a natural language question shown inFIG. 1 according to an exemplary embodiment of the present invention. - Referring to
FIG. 2 , theindex unit 120 of the system for answering a natural language question according to an exemplary embodiment of the present invention analyzes the text of irregular documents stored in thestorage unit 110 shown inFIG. 1 , and classifies and indexes the irregular documents according to meanings of sentences or paragraphs. - As shown in
FIG. 2 , theindex unit 120 may include adocument analyzer 121, asemantic classifier 123, and adocument indexer 125. - The
document analyzer 121 analyzes the text of the irregular documents stored in thestorage unit 110. -
FIGS. 3A to 3C is a diagram showing an example of results of analyzing text of a document by thedocument analyzer 121. As shown inFIGS. 3A to 3C , thedocument analyzer 121 performs morpheme analysis, lexical analysis such as recognition of entity names, syntax analysis, and sentence structure analysis to apply various index units. - Text which is analyzed in depth by the
document analyzer 121 is subsequently used in a semantic classification operation as well as a document index operation. - The
semantic classifier 123 receives the text analyzed by thedocument analyzer 121 and classifies meanings of the received text. At this time, thesemantic classifier 123 classifies the received text in units of sentences or paragraphs. In other words, thesemantic classifier 123 receives the text from thedocument analyzer 121 and classifies the text in units of sentences or paragraphs according to meanings. - Meanwhile,
FIG. 4 is a diagram showing an example of structurally divided paragraphs of a news document. As shown inFIG. 4 , sentences or paragraphs are divided based on structure information of a document or results of understanding a natural language. - The
semantic classifier 123 may classify the text in various categories according to a system request. For example, the text may be classified by work, evaluation, constitution, reason, effect, character, background of growth, and so on. - Also, the
semantic classifier 123 may classify the text by extracting sentence features and generating patterns or by using a machine learning technique. - In addition, the
semantic classifier 123 may classify one sentence or paragraph into two or more meanings. - For example, the
semantic classifier 123 may classify an example sentence “Sunsin Yi was born in Hanseong and passed the military examination in the middle period of the Joseon Dynasty” into two meanings “occupation” and “birth.” - In this way, when the
semantic classifier 123 classifies a classification target into two or more meanings, weights may be given to the respective meanings. - In other words, when the
semantic classifier 123 classifies the example sentence into the two meanings “occupation” and “birth,” weights of 0.7 and 0.3 may be given to “occupation” and “birth,” respectively. - When a classification target is classified into two or more meanings, it is possible to increase the accuracy rate of a search result by giving weights to the respective meanings.
- The
document indexer 125 indexes documents in units of sentences or paragraphs classified according to meanings by thesemantic classifier 123. - At this time, the
document indexer 125 may index the documents in units of morphemes, entity names, phrases, syntax structures, and semantic structures, and may analyze sentence structures and perform indexing in units of 2-tuples (subject-verb and object-verb) and 3-tuples (subject-verb-object). -
-
-
- Since the two sentences have the same meaning but different ways of expression, they are recognized and indexed as sentences having different meanings in the related art.
- To solve this problem, in an exemplary embodiment of the present invention, indexing is performed up to semantic structure units of sentences, and index databases are generated according to semantic classification of the sentences.
-
FIG. 6 is detailed block diagram of theretrieval unit 140 of the system for answering a natural language question according to an exemplary embodiment of the present invention. - Referring to
FIG. 6 , theretrieval unit 140 of the system for answering a natural language question according to an exemplary embodiment of the present invention extracts an index word by semantically analyzing an input question, and searches thedatabase 130 for documents having the extracted index word. - Here, the
retrieval unit 140 may include aquestion input portion 141, aquestion analyzer 143, aquestion classifier 145, and adocument search portion 147. - The
question input portion 141 is configured to receive a question from the outside of the system. For example, thequestion input portion 141 may be a keyboard, a touchpad, etc., but is not limited thereto. - The
question analyzer 143 analyzes the question input through thequestion input portion 141. At this time, thequestion analyzer 143 performs morpheme analysis, lexical analysis such as recognition of entity names, syntax analysis, and sentence structure analysis to apply various index units. - The
question classifier 145 receives the question analyzed by thequestion analyzer 143, classifies a meaning of the question, and extracts an index word. At this time, thequestion classifier 145 classifies the received question in units of sentences or paragraphs, and two or more index words may be extracted by thequestion classifier 145. - The
document search portion 147 searches thedatabase 130 for documents related to the index word extracted by thequestion classifier 145. - When searching the
database 130 for documents, thedocument search portion 147 searches for documents in an index database corresponding to the index word among a plurality of index databases classified according to indices rather than in all regions of thedatabase 130. - When a plurality of index words are extracted by the
question classifier 145, thedocument search portion 147 searches for documents in respective index databases corresponding to the respective index words. - The
provision unit 150 generates a correct answer to the question by analyzing documents searched by theretrieval unit 140, and provides the search results and the correct answer. - At this time, the
provision unit 150 analyzes the number of index words with which documents have been searched, the weight of each index word when documents have been searched with a plurality of index words, a document appropriate for a result of the question, and so on. - When documents are searched with a plurality of index words, the
provision unit 150 determines the weights of the respective index words and provides search results according to the weights. - Also, when documents are searched with a plurality of index words and the weights of the respective index words are determined, if any one weight is too small (e.g., 0.1), search results based on an index word having the small weight may not be provided to increase importance of index words having large weights.
-
FIG. 7 is a diagram showing an example of a screen provided by a provision unit according to an exemplary embodiment of the present invention. - Referring to
FIG. 7 , search results and a correct answer of the question “Who is the admiral born in Hanseong and having passed the military examination in the middle period of the Joseon Dynasty?” are provided. At this time, search results and correct answers of two index words “occupation” and “birth,” which are extracted from the question “Who is the admiral born in Hanseong and having passed the military examination in the middle period of the Joseon Dynasty?,” are provided. - Here, when the weight of the index word “occupation” is 0.6 and the weight of the index word “birth” is 0.4, the
provision unit 150 may provide 60% of search results obtained from an occupation index database and 40% of search results obtained from a birth index database as search results. - Also, when the weight of the index word “occupation” is 0.9 and the weight of the index word “birth” is 0.1, the
provision unit 150 may not provide search results obtained from the birth index database and may provide only search results obtained from the occupation index database. - Thus far, a detailed configuration and functions of the system for answering a natural language question according to an exemplary embodiment of the present invention have been described. A method of answering a natural language question according to an exemplary embodiment of the present invention will be described in stages below.
-
FIG. 8 is an operational flowchart illustrating a method of answering a natural language question according to an exemplary embodiment of the present invention. - Referring to
FIGS. 1 and 8 , theindex unit 120 analyzes the text of irregular documents stored in the storage unit 110 (S810), and classifies and indexes the documents according to meanings of sentences or paragraphs in the documents (S820). - Meanwhile, although not shown in the drawing, a process of storing various kinds of irregular documents in the
storage unit 110 may be further performed. The process of storing irregular documents in thestorage unit 110 may include a process of acquiring information from various channels in the World Wide Web. - In operation S810, morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis may be performed on the documents, and the documents may be classified by extracting sentence features and generating patterns or by using a machine learning technique.
- In operation S820, the sentences or the paragraphs in the documents may be classified based on structural information or results of understanding a natural language.
- Also, in operation S820, one sentence or paragraph may be classified into two or more meanings.
- Meanwhile, in operation S820, the documents may be indexed in units of morphemes, entity names, phrases, syntax structures, and semantic structures. In addition, the documents may also be indexed in units of 2-tuple structures, such as “subject-verb” and “object-verb” and 3-tuple structures, such as “subject-verb-object.”
- After indexing the irregular documents in operation S820, the
index unit 120 transmits the documents indexed according to meanings to thedatabase 130 and stores the indexed documents in the database 130 (S830). - When the documents are stored in the
database 130 in operation S830, it is preferable to divide thedatabase 130 into a plurality of index databases and store the documents indexed according to meanings in the corresponding index databases. - When the documents to be searched to answer a question are stored in the
database 130 in operation S830, theretrieval unit 140 continuously determines whether a question is input (S840). - When it is determined in operation S840 that a question is input (S840-Yes), the
retrieval unit 140 extracts an index word by analyzing the question (S850), and searches thedatabase 130 for documents related to the extracted index word (S860). - In operation S850, morpheme analysis, lexical analysis, syntax analysis, and sentence structure analysis are performed on the question.
- In operation S860, the
retrieval unit 140 searches for documents in an index database corresponding to the index word among the plurality of index databases divided according to indices. - Meanwhile, in operation S850, a plurality of index words may be extracted. In this case, in operation S860, documents are searched in index databases corresponding to the respective extracted index words.
- When documents are searched in operation S860, the
provision unit 150 analyzes the documents searched in operation S860 and provides the search results and a correct answer of the question (S870). - According to exemplary embodiments of the present invention, when documents are searched and indexed, semantically classified paragraphs or sentences are used as search targets instead of whole original documents. When these semantically classified paragraphs or sentences are used, semantically related sentences or paragraphs are searched instead of whole documents, so that users can find desired information with little effort.
- In addition, a currently used search service provides a user with all documents which can be searched for using one search question, and thus the user is required to find desired information in the search results. However, in exemplary embodiments of the present invention, it is analyzed what kind of information a user wants to obtain from a question, and only information wanted by the user is provided.
- Further, unlike related art, questions and search targets are classified into semantic paragraphs and then indexed, and a user is provided with a document including a correct answer as well as the correct answer, so that the correct answer can be highly trusted by the user.
- It will be apparent to those skilled in the art that various modifications can be made to the above-described exemplary embodiments of the present invention without departing from the spirit or scope of the invention. Thus, it is intended that the present invention covers all such modifications provided they come within the scope of the appended claims and their equivalents.
Claims (20)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020140161904A KR102094934B1 (en) | 2014-11-19 | 2014-11-19 | Natural Language Question-Answering System and method |
KR10-2014-0161904 | 2014-11-19 |
Publications (2)
Publication Number | Publication Date |
---|---|
US20160140187A1 true US20160140187A1 (en) | 2016-05-19 |
US10503828B2 US10503828B2 (en) | 2019-12-10 |
Family
ID=55961880
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/737,130 Active 2036-08-25 US10503828B2 (en) | 2014-11-19 | 2015-06-11 | System and method for answering natural language question |
Country Status (2)
Country | Link |
---|---|
US (1) | US10503828B2 (en) |
KR (1) | KR102094934B1 (en) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170262429A1 (en) * | 2016-03-12 | 2017-09-14 | International Business Machines Corporation | Collecting Training Data using Anomaly Detection |
US10339168B2 (en) * | 2016-09-09 | 2019-07-02 | International Business Machines Corporation | System and method for generating full questions from natural language queries |
US10339167B2 (en) * | 2016-09-09 | 2019-07-02 | International Business Machines Corporation | System and method for generating full questions from natural language queries |
CN110019644A (en) * | 2017-09-06 | 2019-07-16 | 腾讯科技(深圳)有限公司 | Searching method, device and computer readable storage medium in dialogue realization |
CN110597952A (en) * | 2019-08-20 | 2019-12-20 | 深圳壹账通智能科技有限公司 | Information processing method, server, and computer storage medium |
CN111538805A (en) * | 2020-05-25 | 2020-08-14 | 武汉烽火普天信息技术有限公司 | A method and system for text information extraction based on deep learning and rule engine |
CN111858878A (en) * | 2020-06-18 | 2020-10-30 | 达而观信息科技(上海)有限公司 | Method, system and storage medium for automatically extracting answer from natural language text |
WO2020251780A1 (en) * | 2019-06-12 | 2020-12-17 | Elsevier, Inc. | Systems and methods for federated search with dynamic selection and distributed relevance |
CN112219198A (en) * | 2018-07-06 | 2021-01-12 | Je国际公司 | Search device and program |
US10929612B2 (en) | 2018-04-24 | 2021-02-23 | Electronics And Telecommunications Research Institute | Neural network memory computing system and method |
CN112417155A (en) * | 2020-11-27 | 2021-02-26 | 浙江大学 | Court trial query generation method, device and medium based on pointer-generation Seq2Seq model |
CN113139048A (en) * | 2021-04-19 | 2021-07-20 | 中国人民解放军91054部队 | Retrieval result providing method and system |
US11144833B2 (en) | 2016-11-23 | 2021-10-12 | Electronics And Telecommunications Research Institute | Data processing apparatus and method for merging and processing deterministic knowledge and non-deterministic knowledge |
CN113779203A (en) * | 2020-06-09 | 2021-12-10 | 北京金山数字娱乐科技有限公司 | Method and device for generating paragraph set and inference method and device |
CN113836283A (en) * | 2021-09-24 | 2021-12-24 | 上海金仕达软件科技有限公司 | Answer generation method and device, electronic equipment and storage medium |
CN113849601A (en) * | 2021-09-17 | 2021-12-28 | 上海数熙传媒科技有限公司 | An Input Pruning Acceleration Method for Question Answering Task Models |
WO2022227165A1 (en) * | 2021-04-28 | 2022-11-03 | 平安科技(深圳)有限公司 | Question and answer method and apparatus for machine reading comprehension, computer device, and storage medium |
CN115828893A (en) * | 2022-11-28 | 2023-03-21 | 北京海致星图科技有限公司 | Method, device, storage medium and equipment for question answering of unstructured document |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6929539B2 (en) * | 2016-10-07 | 2021-09-01 | 国立研究開発法人情報通信研究機構 | Non-factoid question answering system and method and computer program for it |
KR102028155B1 (en) * | 2017-03-06 | 2019-10-08 | 주식회사 수브이 | Document scoring method and document searching system |
KR102075833B1 (en) * | 2018-03-15 | 2020-02-11 | 주식회사 노다멘 | Curation method and system for recommending of art contents |
KR102259390B1 (en) * | 2019-09-09 | 2021-06-02 | 주식회사 솔트룩스 | System and method for ensemble question-answering |
JP7342971B2 (en) * | 2019-12-12 | 2023-09-12 | 日本電信電話株式会社 | Dialogue processing device, learning device, dialogue processing method, learning method and program |
US20210230981A1 (en) * | 2020-01-28 | 2021-07-29 | Schlumberger Technology Corporation | Oilfield data file classification and information processing systems |
KR102351388B1 (en) | 2020-05-06 | 2022-01-14 | 송형석 | System for providing question and automatical answer service of homepage |
KR102584452B1 (en) | 2020-10-07 | 2023-10-05 | 한국전자통신연구원 | Apparatus and method for automatic generation of machine reading comprehension data |
WO2022102827A1 (en) * | 2020-11-16 | 2022-05-19 | 주식회사 솔트룩스 | Knowledge extraction system for scientific and technical papers |
KR102339794B1 (en) * | 2020-12-04 | 2021-12-16 | 주식회사 애자일소다 | Apparatus and method for servicing question and answer |
KR20230056278A (en) | 2021-10-20 | 2023-04-27 | 주식회사 엘지유플러스 | Method and apparatus for natural language processing |
KR20230059601A (en) * | 2021-10-26 | 2023-05-03 | (주) 글루시스 | Structure of structured database and method for using unstructured data in Internet of Things environment |
KR20240042933A (en) | 2022-09-26 | 2024-04-02 | 주식회사 엘지유플러스 | Method and apparatus for natural language processing |
KR102753339B1 (en) * | 2022-11-29 | 2025-01-14 | 한국전자통신연구원 | Method and apparatus for searching groupware bulletin board |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5704060A (en) * | 1995-05-22 | 1997-12-30 | Del Monte; Michael G. | Text storage and retrieval system and method |
US6078913A (en) * | 1997-02-12 | 2000-06-20 | Kokusai Denshin Denwa Co., Ltd. | Document retrieval apparatus |
US20030101182A1 (en) * | 2001-07-18 | 2003-05-29 | Omri Govrin | Method and system for smart search engine and other applications |
US20050086222A1 (en) * | 2003-10-16 | 2005-04-21 | Wang Ji H. | Semi-automatic construction method for knowledge base of encyclopedia question answering system |
US20070073533A1 (en) * | 2005-09-23 | 2007-03-29 | Fuji Xerox Co., Ltd. | Systems and methods for structural indexing of natural language text |
US20070094006A1 (en) * | 2005-10-24 | 2007-04-26 | James Todhunter | System and method for cross-language knowledge searching |
US20070112555A1 (en) * | 2001-03-13 | 2007-05-17 | Ofer Lavi | Dynamic Natural Language Understanding |
US20090292687A1 (en) * | 2008-05-23 | 2009-11-26 | International Business Machines Corporation | System and method for providing question and answers with deferred type evaluation |
US20100057708A1 (en) * | 2008-09-03 | 2010-03-04 | William Henry Billingsley | Method and System for Computer-Based Assessment Including a Search and Select Process |
US20100153094A1 (en) * | 2008-12-11 | 2010-06-17 | Electronics And Telecommunications Research Institute | Topic map based indexing and searching apparatus |
US20110246505A1 (en) * | 2008-07-11 | 2011-10-06 | Jong Sun Jung | File generation and search methods for data search, and database management system for data file search |
US20120229872A1 (en) * | 2009-11-10 | 2012-09-13 | Au10Tix Limited | Apparatus and methods for computerized authentication of electronic documents |
US20130007033A1 (en) * | 2008-05-14 | 2013-01-03 | International Business Machines Corporation | System and method for providing answers to questions |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20010004404A (en) | 1999-06-28 | 2001-01-15 | 정선종 | Keyfact-based text retrieval system, keyfact-based text index method, and retrieval method using this system |
KR20020045343A (en) | 2000-12-08 | 2002-06-19 | 오길록 | Method of information generation and retrieval system based on a standardized Representation format of sentences structures and meanings |
KR100498574B1 (en) * | 2001-03-08 | 2005-07-01 | 주식회사 다이퀘스트 | Real-time Natural Language Question-Answering System Using Unit Paragraph Indexing Method |
KR100546743B1 (en) * | 2003-10-02 | 2006-01-26 | 한국전자통신연구원 | Automatic Question / Answer Indexing Method based on Linguistic Analysis and its Q & A Method |
KR100597435B1 (en) * | 2004-12-07 | 2006-07-10 | 한국전자통신연구원 | Hybrid-based Question Classification System and Method in Information Retrieval and Question Answer System |
KR100745367B1 (en) | 2004-12-14 | 2007-08-02 | 한국전자통신연구원 | Method of index and retrieval of record based on template and question answering system using as the same |
KR100599450B1 (en) * | 2004-12-21 | 2006-07-12 | 한국전자통신연구원 | Answer Index System and Method in Question and Answer System |
KR100756921B1 (en) | 2006-02-28 | 2007-09-07 | 한국과학기술원 | A computer-readable recording medium containing a document classification method and a program for executing the document classification method on a computer. |
KR101042515B1 (en) * | 2008-12-11 | 2011-06-17 | 주식회사 네오패드 | Information retrieval method and information provision method based on user's intention |
KR101667232B1 (en) | 2010-04-12 | 2016-10-19 | 삼성전자주식회사 | Semantic based searching apparatus and semantic based searching method and server for providing semantic based metadata and method for operating thereof |
JP5825676B2 (en) * | 2012-02-23 | 2015-12-02 | 国立研究開発法人情報通信研究機構 | Non-factoid question answering system and computer program |
-
2014
- 2014-11-19 KR KR1020140161904A patent/KR102094934B1/en active Active
-
2015
- 2015-06-11 US US14/737,130 patent/US10503828B2/en active Active
Patent Citations (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5704060A (en) * | 1995-05-22 | 1997-12-30 | Del Monte; Michael G. | Text storage and retrieval system and method |
US6078913A (en) * | 1997-02-12 | 2000-06-20 | Kokusai Denshin Denwa Co., Ltd. | Document retrieval apparatus |
US20070112555A1 (en) * | 2001-03-13 | 2007-05-17 | Ofer Lavi | Dynamic Natural Language Understanding |
US20030101182A1 (en) * | 2001-07-18 | 2003-05-29 | Omri Govrin | Method and system for smart search engine and other applications |
US20050086222A1 (en) * | 2003-10-16 | 2005-04-21 | Wang Ji H. | Semi-automatic construction method for knowledge base of encyclopedia question answering system |
US7428487B2 (en) * | 2003-10-16 | 2008-09-23 | Electronics And Telecommunications Research Institute | Semi-automatic construction method for knowledge base of encyclopedia question answering system |
US20070073533A1 (en) * | 2005-09-23 | 2007-03-29 | Fuji Xerox Co., Ltd. | Systems and methods for structural indexing of natural language text |
US7672831B2 (en) * | 2005-10-24 | 2010-03-02 | Invention Machine Corporation | System and method for cross-language knowledge searching |
US20070094006A1 (en) * | 2005-10-24 | 2007-04-26 | James Todhunter | System and method for cross-language knowledge searching |
US20130007033A1 (en) * | 2008-05-14 | 2013-01-03 | International Business Machines Corporation | System and method for providing answers to questions |
US8768925B2 (en) * | 2008-05-14 | 2014-07-01 | International Business Machines Corporation | System and method for providing answers to questions |
US20140258286A1 (en) * | 2008-05-14 | 2014-09-11 | International Business Machines Corporation | System and method for providing answers to questions |
US8332394B2 (en) * | 2008-05-23 | 2012-12-11 | International Business Machines Corporation | System and method for providing question and answers with deferred type evaluation |
US20090292687A1 (en) * | 2008-05-23 | 2009-11-26 | International Business Machines Corporation | System and method for providing question and answers with deferred type evaluation |
US20110246505A1 (en) * | 2008-07-11 | 2011-10-06 | Jong Sun Jung | File generation and search methods for data search, and database management system for data file search |
US20130275462A1 (en) * | 2008-07-11 | 2013-10-17 | Jong Sun Jung | File creating method for searching of data, searching method of data file and managing system for searching of data file |
US20100057708A1 (en) * | 2008-09-03 | 2010-03-04 | William Henry Billingsley | Method and System for Computer-Based Assessment Including a Search and Select Process |
US20100153094A1 (en) * | 2008-12-11 | 2010-06-17 | Electronics And Telecommunications Research Institute | Topic map based indexing and searching apparatus |
US8554540B2 (en) * | 2008-12-11 | 2013-10-08 | Electronics And Telecommunication Research Institute | Topic map based indexing and searching apparatus |
US20120229872A1 (en) * | 2009-11-10 | 2012-09-13 | Au10Tix Limited | Apparatus and methods for computerized authentication of electronic documents |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10078632B2 (en) * | 2016-03-12 | 2018-09-18 | International Business Machines Corporation | Collecting training data using anomaly detection |
US20170262429A1 (en) * | 2016-03-12 | 2017-09-14 | International Business Machines Corporation | Collecting Training Data using Anomaly Detection |
US10339168B2 (en) * | 2016-09-09 | 2019-07-02 | International Business Machines Corporation | System and method for generating full questions from natural language queries |
US10339167B2 (en) * | 2016-09-09 | 2019-07-02 | International Business Machines Corporation | System and method for generating full questions from natural language queries |
US11144833B2 (en) | 2016-11-23 | 2021-10-12 | Electronics And Telecommunications Research Institute | Data processing apparatus and method for merging and processing deterministic knowledge and non-deterministic knowledge |
CN110019644A (en) * | 2017-09-06 | 2019-07-16 | 腾讯科技(深圳)有限公司 | Searching method, device and computer readable storage medium in dialogue realization |
US10929612B2 (en) | 2018-04-24 | 2021-02-23 | Electronics And Telecommunications Research Institute | Neural network memory computing system and method |
CN112219198A (en) * | 2018-07-06 | 2021-01-12 | Je国际公司 | Search device and program |
US11366814B2 (en) | 2019-06-12 | 2022-06-21 | Elsevier, Inc. | Systems and methods for federated search with dynamic selection and distributed relevance |
WO2020251780A1 (en) * | 2019-06-12 | 2020-12-17 | Elsevier, Inc. | Systems and methods for federated search with dynamic selection and distributed relevance |
CN110597952A (en) * | 2019-08-20 | 2019-12-20 | 深圳壹账通智能科技有限公司 | Information processing method, server, and computer storage medium |
CN111538805A (en) * | 2020-05-25 | 2020-08-14 | 武汉烽火普天信息技术有限公司 | A method and system for text information extraction based on deep learning and rule engine |
CN113779203A (en) * | 2020-06-09 | 2021-12-10 | 北京金山数字娱乐科技有限公司 | Method and device for generating paragraph set and inference method and device |
CN111858878A (en) * | 2020-06-18 | 2020-10-30 | 达而观信息科技(上海)有限公司 | Method, system and storage medium for automatically extracting answer from natural language text |
CN112417155A (en) * | 2020-11-27 | 2021-02-26 | 浙江大学 | Court trial query generation method, device and medium based on pointer-generation Seq2Seq model |
CN113139048A (en) * | 2021-04-19 | 2021-07-20 | 中国人民解放军91054部队 | Retrieval result providing method and system |
WO2022227165A1 (en) * | 2021-04-28 | 2022-11-03 | 平安科技(深圳)有限公司 | Question and answer method and apparatus for machine reading comprehension, computer device, and storage medium |
CN113849601A (en) * | 2021-09-17 | 2021-12-28 | 上海数熙传媒科技有限公司 | An Input Pruning Acceleration Method for Question Answering Task Models |
CN113836283A (en) * | 2021-09-24 | 2021-12-24 | 上海金仕达软件科技有限公司 | Answer generation method and device, electronic equipment and storage medium |
CN115828893A (en) * | 2022-11-28 | 2023-03-21 | 北京海致星图科技有限公司 | Method, device, storage medium and equipment for question answering of unstructured document |
Also Published As
Publication number | Publication date |
---|---|
US10503828B2 (en) | 2019-12-10 |
KR102094934B1 (en) | 2020-03-31 |
KR20160060253A (en) | 2016-05-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10503828B2 (en) | System and method for answering natural language question | |
Alzubi et al. | COBERT: COVID-19 question answering system using BERT | |
Gupta et al. | A survey of text question answering techniques | |
CN107436864B (en) | Chinese question-answer semantic similarity calculation method based on Word2Vec | |
US9213771B2 (en) | Question answering framework | |
US8341112B2 (en) | Annotation by search | |
US20190065576A1 (en) | Single-entity-single-relation question answering systems, and methods | |
US20170330084A1 (en) | Clarification of Submitted Questions in a Question and Answer System | |
Zhu et al. | Twitter sentiment analysis of covid vaccines | |
WO2013125286A1 (en) | Non-factoid question answering system and computer program | |
JP5710581B2 (en) | Question answering apparatus, method, and program | |
CN103229223A (en) | Providing answers to questions using multiple models to score candidate answers | |
JP2011118689A (en) | Retrieval method and system | |
CN112380848A (en) | Text generation method, device, equipment and storage medium | |
Damiano et al. | Towards a framework for closed-domain question answering in Italian | |
Lipczak et al. | Tulip: Lightweight entity recognition and disambiguation using wikipedia-based topic centroids | |
Sasikumar et al. | A survey of natural language question answering system | |
Derici et al. | A closed-domain question answering framework using reliable resources to assist students | |
Basirat et al. | Lexical and morpho-syntactic features in word embeddings-a case study of nouns in swedish | |
Pilaluisa et al. | Contextual word embeddings for tabular data search and integration | |
Paramita et al. | Methods for collection and evaluation of comparable documents | |
Gupta et al. | Hybrid approach for punjabi question answering system | |
Hamoud et al. | Using an Islamic Question and Answer Knowledge Base to answer questions about the holy Quran | |
Mozafari et al. | Attention-based pairwise multi-perspective convolutional neural network for answer selection in question answering | |
Chala et al. | A Framework for Enriching Job Vacancies and Job Descriptions Through Bidirectional Matching. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTIT Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BAE, YONG JIN;KIM, HYUN KI;RYU, PUM MO;AND OTHERS;REEL/FRAME:035825/0233 Effective date: 20150513 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE AFTER FINAL ACTION FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: ADVISORY ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: EX PARTE QUAYLE ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO EX PARTE QUAYLE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2551); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 4 |