hacking-tutorial/python-scripts/https_data_fetcher_bettercap.md at main · AnasheGas/hacking-tutorial

https_data_fetcher_bettercap.py

Usage Scenario

This script is a crucial all-in-one analysis tool for penetration testers, security auditors, and system administrators. It focuses on combining a comprehensive Passive Web Security Scan (HTTP headers, SSL/TLS, forms, technology stack) with an Active Network Traffic Capture using the external utility Bettercap.

What problem does it solve? It addresses the problem of fragmented security analysis by generating a single, structured report that correlates web application vulnerabilities (e.g., missing security headers, insecure cookies, weak SSL) with on-the-wire network intelligence (DNS queries, TCP connections). This holistic view provides crucial context often missed by static scanners.
Who would benefit from using it? Ethical Hackers/Penetration Testers benefit by automating the initial reconnaissance and vulnerability discovery phase. Security Engineers use it to continuously monitor and assess the security posture of their web applications and underlying network infrastructure, receiving a prioritized Vulnerability Score (0-100) and Risk Level (LOW to CRITICAL).

What the Script Does

The Python script acts as a multi-stage security assessment engine that integrates web analysis with network sniffing via Bettercap.

The execution flow is:

Input: It requires a mandatory Target HTTPS URL (https://...) and a Network Interface (e.g., eth0, wlan0) for Bettercap.
Process - Stage 1 (Passive Web Analysis): It fetches the URL, parses the HTML with BeautifulSoup, analyzes all linked resources, and performs a deep dive into the server's security posture. This includes checks on SSL/TLS configuration, HTTP security headers (HSTS, CSP), and cookie security attributes (Secure, HttpOnly).
Process - Stage 2 (Active Network Capture): It spawns the external Bettercap process on the specified interface, actively capturing network traffic (DNS, TCP, TLS handshakes) while simultaneously generating controlled test traffic to the target. This gathers network-level intelligence.
Process - Stage 3 (Analysis and Reporting): It aggregates all web data and network logs, calculates a Vulnerability Score based on proprietary rules, assigns a Risk Level, and compiles a final, structured report with prioritized recommendations.

Output/Side Effects: The script prints real-time logs and generates a timestamped, comprehensive report file (.json, .csv, or .html) containing all extracted data, scores, and recommendations. Note: Requires root/sudo access due to the use of Bettercap for packet capture.

Code Explanation

Imported Libraries

Library	Purpose
`subprocess`, `sys`, `os`, `time`, `threading`	Core utilities for managing the script's execution, handling system commands, and concurrent operations (crucial for running Bettercap simultaneously with web fetching).
`requests`, `urllib3`	Essential for making HTTP/HTTPS requests to the target URL and managing connection warnings (e.g., suppressing warnings for skipped SSL verification).
`beautifulsoup4` (`bs4`)	Used for parsing the HTML content of the target page, enabling extraction of links, forms, scripts, and meta tags for deeper analysis.
`json`, `csv`, `re`, `socket`, `ssl`, `hashlib`, `datetime`, `urllib.parse`	Standard libraries for data handling, regular expressions, network socket programming (used in `get_ssl_info`), and date/URL manipulation.
`cryptography`	Used for advanced analysis of SSL/TLS certificates, including chain validation and public key size assessment.
`geoip2`	Utilized for IP geolocation analysis of network addresses discovered during the Bettercap capture.

Main Functions

Function	Parameters	Returns	Purpose
`fetch_https_data`	`url`, `verify_ssl`	`dict`	Performs the passive web scan: fetches content, parses HTML, extracts all links/forms, detects technologies, and initiates security sub-analyses.
`get_ssl_info`	`hostname`, `port`	`dict`	Connects directly to the host to perform a deep SSL/TLS certificate analysis, checking for weak versions, key sizes, and expiry dates.
`analyze_cookies`	`headers`	`dict`	Parses `Set-Cookie` headers to check for security flags (`Secure`, `HttpOnly`, `SameSite`) and assigns severity to missing attributes.
`analyze_csp`	`headers`	`dict`	Parses and evaluates the Content Security Policy header, identifying the presence of dangerous directives like `unsafe-inline` or `unsafe-eval`.
`start_bettercap`	`iface`	`subprocess.Popen`	Spawns the Bettercap process with necessary modules activated (e.g., `http.proxy.sslstrip true`, `net.sniff.local true`) for network capture.
`capture_traffic`	`proc`, `target_url`, `duration`, `deep_scan`	`dict`	Reads and parses Bettercap's real-time output for the specified duration, extracting and structuring network intelligence (DNS, TCP, IP geolocation).
`calculate_vulnerability_score`	`analysis_data`	`dict`	Applies a rule-based scoring system (deducting points based on issue severity) to assign a final score (0-100) and a Risk Level (LOW, MEDIUM, HIGH, CRITICAL).
`analyze_and_report`	`url_data`, `network_data`	`dict`	The orchestrator function that merges all findings, calculates the score, and compiles the final, structured report with recommendations.

Execution Logic

The script follows a sequential, four-step flow managed by the main function (if __name__ == "__main__":):

Initialization and Arguments: The script first validates all command-line arguments (URL, interface) and parses optional flags like --no-verify and --timeout.
HTTPS Web Scan (Passive): It calls fetch_https_data(). If the data retrieval is successful, it proceeds to perform detailed analyses on headers, cookies, CSP, and SSL/TLS configuration using the respective sub-functions.
Bettercap Capture (Active): It attempts to start the Bettercap process via start_bettercap(). If successful, it enters the capture_traffic() phase, where it runs concurrently with the web analysis to gather network logs for the specified duration. The Bettercap process is explicitly terminated before proceeding.
Reporting and Output: The collected url_data and network_data are passed to analyze_and_report(). This function generates the final vulnerability score and risk assessment. A summary is printed to the console (print_summary), and the full, detailed report is saved to a file using save_report() in the user-specified format (JSON, CSV, or HTML).

Screenshots

Sequence Diagram

sequenceDiagram
    participant User
    participant Main as https_data_fetcher_bettercap.py (main)
    participant RequestsLib as requests/bs4/cryptography
    participant Bettercap as bettercap Process
    participant OS as Operating System
    participant TargetServer as Target HTTPS Server
    participant FileSystem as Report Filesystem

    User->>Main: Execute (sudo python script.py <url> <iface>)
    Main->>Main: Parse arguments, Start scan timer

    activate Main
    
    Note over Main: [STEP 1] HTTPS analysis...

    Note over Main,TargetServer: --- Phase 1: Web Analysis Start ---
    Main->>RequestsLib: fetch_https_data(url, verify_ssl)
    RequestsLib->>TargetServer: HTTP/S GET Request
    TargetServer-->>RequestsLib: HTTP/S Response
    RequestsLib->>RequestsLib: Parse HTML & detect_technologies
    RequestsLib->>TargetServer: get_ssl_info() (TLS Handshake)
    TargetServer-->>RequestsLib: SSL Certificate Details
    RequestsLib->>Main: Return url_data
    Note over Main,TargetServer: --- Phase 1: Web Analysis End ---
    
    Note over Main: [STEP 2] Network analysis...

    Main->>OS: start_bettercap(interface)
    OS->>Bettercap: Start bettercap Process
    Bettercap-->>Main: Return bettercap_proc

    alt Bettercap Started
    
        Note over Main: [STEP 3] Traffic capture...

        Note over Main,TargetServer: --- Phase 2: Network Capture Start ---
        Main->>Main: capture_traffic(proc, url, timeout)
        Main->>Bettercap: Send commands (net.sniff on)
        Main->>TargetServer: Generate controlled traffic
        TargetServer-->>Bettercap: Network Traffic Stream
        Bettercap->>Main: stdout Stream (Parse events)
        Main->>Bettercap: Send commands (net.sniff off)
        Bettercap-->>Main: Return network_data
        Note over Main,TargetServer: --- Phase 2: Network Capture End ---
        
        Main->>Bettercap: proc.terminate()
        Bettercap->>OS: Terminate Process
        
    else Bettercap Failed
        Main->>Main: Skip network capture
    end
    
    Note over Main: [STEP 4] Analysis and reporting...

    Main->>Main: analyze_and_report(url_data, network_data)
    Main->>Main: calculate_vulnerability_score()
    
    Main->>User: print_summary(report)
    
    Main->>FileSystem: save_report(format, dir)
    FileSystem-->>Main: Report file saved
    
    Main->>User: Print final confirmation

    deactivate Main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

https_data_fetcher_bettercap.py

Usage Scenario

What the Script Does

Code Explanation

Imported Libraries

Main Functions

Execution Logic

Screenshots

Sequence Diagram

FilesExpand file tree

https_data_fetcher_bettercap.md

Latest commit

History

https_data_fetcher_bettercap.md

File metadata and controls

https_data_fetcher_bettercap.py

Usage Scenario

What the Script Does

Code Explanation

Imported Libraries

Main Functions

Execution Logic

Screenshots

Sequence Diagram