학술논문

Engineering Web resource summaries using Pointwise mutual information (PMI)-based for web document summarization
Document Type
Conference
Source
2022 International Conference on Innovations in Science and Technology for Sustainable Development (ICISTSD) Innovations in Science and Technology for Sustainable Development (ICISTSD), 2022 International Conference on. :215-221 Aug, 2022
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Technological innovation
Oceans
Machine learning
Internet
Reliability
Data mining
Sustainable development
Document Summarization
Text Summarization
Point-Wise Mutual Information (PMI)
Total Point-Wise Mutual Information (TPMI)
Sentence Repository (SR)
Language
Abstract
Internet is an ocean of information which is growing everyday. It is impossible for any human being to read the whole information. It is very difficult to decide which document is relevant. If someone gets summary of the document that is the easiest way to decide. Therefore, there is a need to create automatic text summarization systems which generate a precise summary. In this paper, a study of summaries generated using PMI based web document summarization technique, which creates summary on ranking of sentences on the basis of Total PointWise Mutual Information of sentences. The data is gathered basically from documents of engineering domain, for text summarization. The results show that the method outperforms the other techniques by exhibiting the best results for the closest mean score and generating a good quality summary of sentences of different lengths. It also depicts that the proposed approach has minimum error compared to other machine learning approaches.