Automatic Text Summarization of Patent Documents
(2020) EITM01 20202Department of Electrical and Information Technology
- Abstract
- This thesis investigates Automatic Text Summarization and how it can be used to generate summaries of patent documents. The purpose of the generated summary was to convey the patent document’s subject so that the reader could decide whether the patent document is relevant and should be read in full or could be discarded. This, in order to reduce the time patent attorneys need to spend reading full patent documents in their daily work.
A summarizing tool using Extraction-based summarization, and a Graph-based ranking method was implemented and tested. Summaries were generated from ten patent descriptions using the implemented tool. Each summary was also evaluated using human evaluation. The method showed very promising results in... (More) - This thesis investigates Automatic Text Summarization and how it can be used to generate summaries of patent documents. The purpose of the generated summary was to convey the patent document’s subject so that the reader could decide whether the patent document is relevant and should be read in full or could be discarded. This, in order to reduce the time patent attorneys need to spend reading full patent documents in their daily work.
A summarizing tool using Extraction-based summarization, and a Graph-based ranking method was implemented and tested. Summaries were generated from ten patent descriptions using the implemented tool. Each summary was also evaluated using human evaluation. The method showed very promising results in summarizing patent descriptions. In addition, the results highlighted some areas of improvement for future work. (Less)
Please use this url to cite or link to this publication:
http://lup.lub.lu.se/student-papers/record/9033080
- author
- Gustafsson, Elin LU
- supervisor
- organization
- course
- EITM01 20202
- year
- 2020
- type
- H2 - Master's Degree (Two Years)
- subject
- keywords
- automatic text summarization, patent documents, natural language processing, extraction-based summarization, PageRank, legal tech
- report number
- LU/LTH-EIT 2020-798
- language
- English
- id
- 9033080
- date added to LUP
- 2021-01-13 14:04:00
- date last changed
- 2021-01-13 14:04:00
@misc{9033080, abstract = {{This thesis investigates Automatic Text Summarization and how it can be used to generate summaries of patent documents. The purpose of the generated summary was to convey the patent document’s subject so that the reader could decide whether the patent document is relevant and should be read in full or could be discarded. This, in order to reduce the time patent attorneys need to spend reading full patent documents in their daily work. A summarizing tool using Extraction-based summarization, and a Graph-based ranking method was implemented and tested. Summaries were generated from ten patent descriptions using the implemented tool. Each summary was also evaluated using human evaluation. The method showed very promising results in summarizing patent descriptions. In addition, the results highlighted some areas of improvement for future work.}}, author = {{Gustafsson, Elin}}, language = {{eng}}, note = {{Student Paper}}, title = {{Automatic Text Summarization of Patent Documents}}, year = {{2020}}, }