I've asked ChatGPT to write a honeypot
Short story about how you can use AI expertize to learn new things
Introduction
Honeypots are decoy systems that mimic real targets and lure attackers into revealing their techniques and intentions. In this article, we will explore how to create honeypots using various tools and methods, and how to analyze the data collected from them. We will also discuss the benefits and challenges of using honeypots for cybersecurity purposes.
{Bing AI, Professional, Paragraph, Short}
At this point I would like to state, I'm no prompt engineering expert. So the methods I'm using are simply conclusions from previous experience, or articles I'm reading.
Idea
Yeah, why not? I've seen people creating games when they don't have any experience with programming or making a Minecraft hack. I thought, let's see by myself where I can get.
So the plan was to write a honeypot in Python, that will simply respond to connections on port 80 in a way it can "fool" a malicious actor. I'm using free online version of ChatGPTv3 for that - becasue I think it is the most accessible service.
Initial Prompt
AI Prompt
I'm assuming there could be some ethical problems for AI to discuss some of the topics like port scanning techniques and such - so I've started with some older version of bypass to mitigate negative responses. "Assume a role of expert who is advising somebody else". In the first response advised me: it is important to thoroughly test and secure any honeypot software before deploying it in a real-world environment.
The first version of honeypot looks like this (original, non-altered version - comments are also from AI):
import socket
def honeypot():
host = ''
port = 80
# create socket object
s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
# bind socket to a public host and a well-known port
s.bind((host, port))
# listen for incoming connections
s.listen(5)
print('Listening on port', port)
while True:
# establish connection with client
conn, addr = s.accept()
# log the connection
print('Received connection from', addr)
# send a fake HTTP response
response = 'HTTP/1.1 200 OK\r\nContent-Length: 20\r\n\r\n<Honeypot response>'
conn.sendall(response.encode())
# close the connection
conn.close()
if __name__ == '__main__':
honeypot()
To be honest, it does its work. I had a question about what s.listen(5)
means, which was answered in details.
Improvements
After that, I introduced some suggestions and more questions, for example, I wanted that:
Content-Length to be calculated dynamically,
string interpolation was used instead of
format()
,get more information about the client from incoming connection,
log the information in the file in a way I can later use the data to analyze it.
Before I concluded my work I asked ChatGPT if there are any security vulnerabilities that I should address before I start using it.
If you think about it - some of these hints are not making sense in honeypot - like filtering incoming connections or implementing authentication mechanisms.
So well, it's got a good point that honeypots could be used as a proxy or could act as an entrypoint for the internal network.
Implementation
Finally I've ended with following code
import socket
import datetime
HOST = '' # Empty string means bind to all available interfaces
PORT = 80 # Port to listen on
# Create a socket object
s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
# Bind the socket to a public host and port
s.bind((HOST, PORT))
# Set the socket to listening mode with a maximum backlog of connections
s.listen(5)
print(f'Honeypot is listening on {HOST}:{PORT}')
# Open the log file for writing
with open('honeypot.log', 'a') as f:
while True:
# Wait for a connection
conn, addr = s.accept()
# Log the connection
current_time = datetime.datetime.now()
print(f'Received connection from {addr[0]}:{addr[1]} at {current_time}')
f.write(f'{current_time}, connection from {addr[0]}:{addr[1]}\n')
# Receive the data in small chunks and retransmit it
data = b''
try:
while True:
chunk = conn.recv(1024)
if not chunk:
break
data += chunk
# Check if the data size is too large
if len(data) > 1024*1024:
print(f'Data from {addr[0]}:{addr[1]} is too large, closing connection.')
f.write(f'{current_time}, data from {addr[0]}:{addr[1]} is too large, closing connection\n')
conn.close()
break
# Send the response with a custom banner
response = b'HTTP/1.1 200 OK\r\n'
response += b'Server: Apache/2.2.17 (Unix) mod_ssl/2.2.17 OpenSSL/0.9.8g\r\n'
response += b'Content-Type: text/html\r\n'
response += b'Content-Length: 20\r\n'
response += b'\r\n'
response += b'This is not the web page you are looking for.'
conn.sendall(response)
except (socket.error, ConnectionResetError):
print(f'Error receiving data from {addr[0]}:{addr[1]}, closing connection.')
f.write(f'{current_time}, error receiving data from {addr[0]}:{addr[1]}, closing connection\n')
except socket.timeout:
print(f'Timeout error occurred when sending response to {addr[0]}:{addr[1]}, closing connection.')
f.write(f'{current_time}, timeout error when sending response to {addr[0]}:{addr[1]}, closing connection\n')
finally:
# Log the request
f.write(f'{current_time}, request from {addr[0]}:{addr[1]}, data: {data.decode("utf-8", "ignore")}\n')
# Close the connection
conn.close()
Tests
First, I've tried a simple curl
. It worked.. partially. For some reason, curl
hanged. But I've ignored that issue and launched the nmap -sV -sC -p80 localhost -vvv
scan. Interestingly, it was completed in a little below 4 minutes. So that simple program can slow down attackers to 4 minutes on a single port 80! Well, that's quite amazing. Interestingly, because we are logging each request in the file - I can see what nmap
is doing behind the scenes.
The results of the scan are inconclusive.
Conclusion
Did it work? Yeah, I must say it works pretty well for the untargeted scans. It's far too little tests done, but I wanted mainly to check how it will handle such a specific task - and how would such honeypot perform under nmap
scan.
There is still a lot of work to do, to make it more robust, add some more services, improve logging.
Resources