💡

This write-up is a part of the HTB Sherlocks series. Sherlocks are investigative challenges that test defensive security skills. I encourage you to try them out if you like digital forensics, incident response, post-breach analysis and malware analysis. Are you ready to start the investigation?

Incident Details

Name: Litter
Category: SOC
Difficulty: Easy (Solved)

Khalid has just logged onto a host that he and his team use as a testing host for many different purposes, it’s off their corporate network but has access to lots of resources in network. The host is used as a dumping ground for a lot of people at the company but it’s very useful, so no one has raised any issues. Little does Khalid know; the machine has been compromised and company information that should not have been on there has now been stolen – it’s up to you to figure out what has happened and what data has been taken.

Evidences

All evidence files are marked as readonly right after acquiring and their hash is written down. Read-only attribute does not affect the hash of a file.

01: ZIP archive, password protected (hacktheblue)

$ litter.zip
89ffb7db178388aad03ff1434279e28426be2421d8df6fc9c3afbee94de23b0d

02: PCAP file

$ suspicious_traffic.pcap
830c2c76e8055c6d807f642c684f716e3be73ddc33403d83c85f486086da47ca

Analysis

I've glanced through the PCAP file while having in mind that we have data exfiltration scenario. The first two things that come to my mind are DNS exfiltration and simply uploading data via TCP.

These shouldn't look like that!

At a glance, what protocol appears to be suspect in this attack?

DNS!

There seems to be a lot of traffic between our host and another, what is the IP address of the suspect host?

Wireshark > Statistics > Endpoints. Select IPv4, sort from the highest Packets.

What is the first command the attacker sends to the client?

Technically, it is possible to read what was transmitted using Wireshark display filter

ip.src == 192.168.157.144 and ip.dst == 192.168.157.145 and dns

But there is a Python tool that helps to extract that information from PCAP file.

$ python /opt/dnscat/dnscat_decoder.py suspicious_traffic.pcap "microsofto365.com" > dnscat_dump.txt

All following questions can be answered purely from the dump.

What is the version of the DNS tunneling tool the attacker is using?
The attackers attempt to rename the tool they accidentally left on the client’s host. What do they name it to?
The attacker attempts to enumerate the user’s cloud storage. How many files do they locate in their cloud storage directory?
What is the full location of the PII file that was stolen?
Exactly how many customer PII records were stolen?

Dump shows 720 records, but starts from 0 s, so the answer is 721.

Data Recovery

None required.

Lessons Learned

Because this challenge was a bit short on analysis part, let's stay a bit on DNS exfiltration.

DNS Data Exfiltration

MITRE ATT.CK Technique: T1048

Data exfiltration is all about flying under the radar. You can steal data simply by copying it to some removable drive (which is not that uncommon). You can upload to cloud service or network share - but these leaves very verbose fingerprint on TCP/UPD communication, network logs etc. The challenge is here that usual protocols can and are easily set up to be monitored against data leakage. Attackers started to get creative and thought - maybe we try to use something that generates a lot of traffic so will not raise an alert, and something that is commonly and widely used, so can't be blocked and won't look suspicious.

DNS protocol meets those standards (that come to my mind, definitely there are more of them). And there were times when no one monitored DNS traffic - because what for? It is more like a technical thing, "how the Internet works" thing - domain names act as mnemonics for specific IP addresses, so something has to resolve them. We can't drop it, we can't stop DNS resolution at all. And each system, each software, can generate a lot of DNS queries on daily routine. Windows Update, loading CSS or JS on pages, virus signatures updates, requesting data from services - all of these happen via named mnemonics, not IP addresses.

So how attackers figured out the way to use it as a form of connection between C&C and a victim? This is to be honest quite simple and easy to explain.

Assume that malware is already present on the victim's server.
Encode data (C&C command response, file, anything) to hex-encoded blocks, as if requesting DNS server to resolve the domain - conforming one of two formats (really oversimplified)
<encoded data>.<domain> or <tag>.<encoded data>
Pass the prepared message as a DNS request to the attacker's server.

That's why in this challenge, we could eyeball the suspiciously looking DNS request - no-one is asking to resolve the following domain name: 7768656e6d6f7265636861696e7361776d616e736572696573.microsofto.com

HTB Sherlock: Litter

Comments

HTB Sherlocks

HTB Sherlock: Bumblebee

More from this blog

Driver's Shadow: Unmasking the Kernel Intruder

HTB Sherlock: Noted

HTB Sherlock: MisCloud

Forensics: Tales for the Brave

👆 Access Patron Benefits 👆

Incident Details

Evidences

Analysis

Data Recovery

Lessons Learned

DNS Data Exfiltration

Additional readings

Command Palette

Comments

HTB Sherlocks

HTB Sherlock: Bumblebee

More from this blog

👆 Access Patron Benefits 👆

Incident Details

Evidences

Analysis

Data Recovery

Lessons Learned

DNS Data Exfiltration

Additional readings