학술논문

Phishing Website Detection Using Machine Learning
Document Type
Conference
Source
2023 1st International Conference on Circuits, Power and Intelligent Systems (CCPIS) Circuits, Power and Intelligent Systems (CCPIS), 2023 1st International Conference on. :1-6 Sep, 2023
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Fields, Waves and Electromagnetics
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Protocols
Machine learning algorithms
Phishing
Web pages
Machine learning
Passwords
Natural language processing
Personal Information
Machine Learning
Malicious Links
Phishing Domain Characteristics
Language
Abstract
Phishing is a common tactic employed to deceive unsuspecting individuals into revealing their personal information through fraudulent websites. These phishing websites often steal passwords, usernames, and sensitive data related to online financial transactions. Phishers create websites that mimic the appearance and language of legitimate web pages to trick their targets. To combat the evolving techniques used in phishing attacks, it is crucial to implement antiphishing measures that can detect such attempts. Machine learning can play a significant role in stopping phishing attacks. This proposal presents a compilation of features and machine learningbased algorithms for identifying phishing attempts. Attackers frequently resort to phishing because it is easier to trick victims into clicking on seemingly legitimate yet malicious links, compared to circumventing computer security protocols. By replicating the official website's logo and other elements, the malicious link in the message body gives the impression of being affiliated with the legitimate website. This proposal explores the distinguishing characteristics of phishing domains, also known as scam domains, and emphasizes the importance of identifying them. Additionally, it discusses machine learning and natural language processing techniques that can be employed to achieve this goal.