TY - GEN
T1 - GReAT
T2 - 15th International Conference on Enterprise Information Systems, ICEIS 2013
AU - Puyana, Claudia Gomez
AU - Quimbaya, Alexandra Pomares
PY - 2013
Y1 - 2013
N2 - The excessive amount of available narrative texts within diverse domains such as health (e.g. medical records), justice (e.g. laws, declarations), assurance (e.g. declarations), etc. increases the required time for the analysis of information in a decision making process. Different approaches of summary generation of these texts have been proposed to solve this problem. However, some of them do not take into account the sequentiality of the original document, which reduces the quality of the final summary, other ones create overall summaries that do not satisfy the end user who requires a summary that is related to his profile (e.g. different medical specializations require different information) and others do not analyze the potential duplication of information and the noise of natural language on the summary. To cope these problems this paper presents GReAT a model for automatic summarization that relies on natural language processing and text mining techniques to extract the most relevant information from narrative texts focused on the requirements of the end user. GReAT is an extraction based summary generation model which principle is to identify the user's relevant information filtering the text by topic and frequency of words, also it reduces the number of phrases of the summary avoiding the duplication of information. Experimental results show that the functionality of GReAT improves the quality of the summary over other existing methods.
AB - The excessive amount of available narrative texts within diverse domains such as health (e.g. medical records), justice (e.g. laws, declarations), assurance (e.g. declarations), etc. increases the required time for the analysis of information in a decision making process. Different approaches of summary generation of these texts have been proposed to solve this problem. However, some of them do not take into account the sequentiality of the original document, which reduces the quality of the final summary, other ones create overall summaries that do not satisfy the end user who requires a summary that is related to his profile (e.g. different medical specializations require different information) and others do not analyze the potential duplication of information and the noise of natural language on the summary. To cope these problems this paper presents GReAT a model for automatic summarization that relies on natural language processing and text mining techniques to extract the most relevant information from narrative texts focused on the requirements of the end user. GReAT is an extraction based summary generation model which principle is to identify the user's relevant information filtering the text by topic and frequency of words, also it reduces the number of phrases of the summary avoiding the duplication of information. Experimental results show that the functionality of GReAT improves the quality of the summary over other existing methods.
KW - Natural language processing
KW - Summary generation
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=84887724371&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84887724371
SN - 9789898565594
T3 - ICEIS 2013 - Proceedings of the 15th International Conference on Enterprise Information Systems
SP - 280
EP - 288
BT - ICEIS 2013 - Proceedings of the 15th International Conference on Enterprise Information Systems
Y2 - 4 July 2013 through 7 July 2013
ER -