Phishing websites dataset on kaggle. See full list on github. Feb 7, 2024 · URL dataset with more than 800,000 URLs where 52% of the domains are legitimate and the remaining 47% are phishing domains. com Jun 19, 2025 · The PhishOFE Dataset - A Phishing URL Dataset is a comprehensive dataset designed for phishing URL detection using machine learning techniques. The dataset has been taken from Kaggle and is available at Phishing Site URLs Dataset. . It consists of two columns: 'URL' containing the website URLs and 'Label' containing the corresponding labels ('phishing' or 'legitimate'). It provides insights into key characteristics that distinguish phishing websites from legitimate ones. Identify Phishing using Machine learning AlgorithmsSomething went wrong and this page crashed! If the issue persists, it's likely a problem on our side. Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals. A useful dataset for analyzing and detecting phishing websites Dataset consisting of numerous phishing websites. Comprehensive phishing detection datasets collectionSomething went wrong and this page crashed! If the issue persists, it's likely a problem on our side. Dataset contains total 18 features which is used in detection phishing website. The dataset contains 101,083 URLs, with labeled features extracted from both the URL structure and HTML content of webpages. Something went wrong and this page crashed! If the issue persists, it's likely a problem on our side. It is a collection of data samples from various sources, the URLs were collected from the JPCERT website, existing Kaggle datasets, Github repositories where the URLs are updated once a year and some open source databases Unmasking Cyber Threats: Investigating a Dataset of Phishing Websites The dataset used in this notebook contains URLs along with their corresponding labels (phishing or legitimate). Detect Phishing in Web PagesSomething went wrong and this page crashed! If the issue persists, it's likely a problem on our side. gloe2 29srqdfa 3n cv fs1 ehg mtqu ks z6hrt3rw 4tdn