Log Format Catalog
This catalog covers 18 common log formats and provides ready-to-use log_format patterns for the DetectMate parsers, such as the MatcherParser.
Key concept: log_format splits a raw log line into structured header fields that become entries in ParserSchema.logFormatVariables. The special <Content> token captures the variable message body, which is then matched against your template file to produce EventID and variables. Formats without <Content> are fully structured — no template file is needed.
Generic configuration:
parsers:
MyParser:
method_type: matcher_parser
auto_config: false
log_format: "<Date> <Time> <Level> <Component>: <Content>"
params:
path_templates: path/to/templates.txt # only required when log_format contains <Content>
Quick Reference
| Format | Category | Log source | Has <Content> |
|---|---|---|---|
| audit | Security | Linux auditd | yes |
| ApacheAccess | Web & Network | Apache httpd access log | no |
| OpenSSH | Security | OpenSSH sshd | yes |
| Apache | Web & Network | Apache httpd error log | yes |
| Proxifier | Web & Network | Proxifier proxy client | yes |
| HDFS | Distributed Systems | Hadoop HDFS daemon | yes |
| Hadoop | Distributed Systems | Hadoop MapReduce | yes |
| Spark | Distributed Systems | Apache Spark | yes |
| Zookeeper | Distributed Systems | Apache Zookeeper | yes |
| OpenStack | Distributed Systems | OpenStack Nova/etc. | yes |
| BGL | HPC / Supercomputers | IBM BlueGene/L | yes |
| HPC | HPC / Supercomputers | Generic HPC cluster | yes |
| Thunderbird | HPC / Supercomputers | Thunderbird supercomputer | yes |
| Linux | Operating Systems | Linux syslog / auth | yes |
| Windows | Operating Systems | Windows CBS/Setup | yes |
| Mac | Operating Systems | macOS system log | yes |
| Android | Operating Systems | Android logcat | yes |
| HealthApp | Application | Mobile health app | yes |
Security
audit
Linux auditd log. Produced by the kernel audit subsystem; records system calls, authentication, and policy events.
type=USER_ACCT msg=audit(1642723741.072:375): pid=10125 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting acct="root" exe="/usr/sbin/cron" hostname=? addr=? terminal=cron res=success'
log_format: type=<Type> msg=audit(<Time>): <Content>
Variable noise patterns: Numeric field values (=\d+), IP addresses ((\d+\.){3}\d+).
OpenSSH
OpenSSH sshd authentication log. Records login attempts, key exchanges, and session events.
Dec 10 07:07:38 LabSZ sshd[24206]: Failed password for invalid user test9 from 192.168.1.1 port 20992 ssh2
log_format: <Date> <Day> <Time> <Component> sshd[<Pid>]: <Content>
Variable noise patterns: IP addresses ((\d+\.){3}\d+), hostnames (([\w-]+\.){2,}[\w-]+).
Web & Network
ApacheAccess
Apache httpd access log (Combined Log Format). Each line is fully structured — no template file is required.
64.242.88.10 - - [07/Mar/2004:16:10:02 -0800] "GET /twiki/bin/edit/Main/Double_bounce_sender?topicparent=Main.ConfigurationVariables HTTP/1.1" 401 12846 "-" "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2) Gecko/20021202"
log_format: <IP> - - [<Time>] "<RequestType> <Request> <Version>" <Status> <ResponseSize> "<Referer>" "<UserAgent>"
All fields land directly in
logFormatVariables. Useheader_variablesin your detector config to track e.g.Status,IP, orRequestType.
Apache
Apache httpd error log. Records server errors, warnings, and informational notices.
[Sun Dec 04 04:47:44 2005] [notice] workerEnv.init() ok /etc/httpd/conf/workers2.properties
log_format: [<Time>] [<Level>] <Content>
Variable noise patterns: IP addresses ((\d+\.){3}\d+).
Proxifier
Proxifier proxy client log. Records connection open/close events with traffic statistics.
[10:39:55] Skype.exe - www.bing.com:443 open through proxy 1.2.3.4:8080 OK
log_format: [<Time>] <Program> - <Content>
Variable noise patterns: Duration values (<\d+\ssec), domain:port pairs (([\w-]+\.)+[\w-]+(:\d+)?), transfer sizes ([KGTM]B).
Distributed Systems
HDFS
Hadoop Distributed File System daemon log. Emitted by DataNodes, NameNodes, and other HDFS services.
081109 203518 148 INFO dfs.DataNode$DataXceiver: Receiving block blk_-1608999687919862906 src: /10.251.73.220:54106 dest: /10.251.73.220:50010
log_format: <Date> <Time> <Pid> <Level> <Component>: <Content>
Variable noise patterns: HDFS block IDs (blk_-?\d+), IP addresses with optional port ((\d+\.){3}\d+(:\d+)?).
Hadoop
Hadoop MapReduce / YARN log. Produced by the ResourceManager, NodeManager, and application containers.
2015-10-17 15:37:15,811 INFO [main] org.apache.hadoop.mapreduce.v2.app.MRAppMaster: Created MRAppMaster for application appattempt_1443144491176_0001_000001
log_format: <Date> <Time> <Level> [<Process>] <Component>: <Content>
Variable noise patterns: IP addresses ((\d+\.){3}\d+).
Spark
Apache Spark log. Emitted by driver and executor JVMs.
15/10/16 03:35:11 INFO storage.BlockManagerMasterActor: Registering block manager 192.168.1.1:61698 with 896.4 MB RAM, BlockManagerId(1, 192.168.1.1, 61698)
log_format: <Date> <Time> <Level> <Component>: <Content>
Variable noise patterns: IP addresses, memory size units (\b[KGTM]?B\b), domain-like names (([\w-]+\.){2,}[\w-]+).
Zookeeper
Apache Zookeeper log. Produced by quorum peers and server threads.
2015-07-29 17:37:13,090 - INFO [QuorumPeer:NIOServerCnxn@943] - Closed socket connection for client /192.168.1.8:56613 which had sessionid 0x14eb8e0aac70006
log_format: <Date> <Time> - <Level> [<Node>:<Component>@<Id>] - <Content>
Variable noise patterns: IP addresses with optional port ((/|)(\d+\.){3}\d+(:\d+)?).
OpenStack
OpenStack service log (Nova, Neutron, etc.). Structured with a request context field.
nova-manage.log 2016-09-28 03:51:11.899 25746 INFO nova.metadata.handler [req-5a1f3f8a-fd7a-4dc5-bb32-f642b2de2e74 - - - - -] 127.0.0.1 "GET /openstack/2012-08-10/meta-data/local-ipv4" status: 200 len: 14 time: 0.007720
log_format: <Logrecord> <Date> <Time> <Pid> <Level> <Component> [<ADDR>] <Content>
Variable noise patterns: IP addresses (((\d+\.){3}\d+,?)+), file paths (/.+?\s), numeric values (\d+).
HPC / Supercomputers
BGL
IBM BlueGene/L supercomputer log. Contains hardware-level diagnostic messages with node location identifiers.
- 1117838570 2005.06.03 R02-M1-N0-C:J12-U11 2005-06-03-15.42.50.363779 R02-M1-N0-C:J12-U11 RAS KERNEL INFO instruction cache parity error corrected
log_format: <Label> <Timestamp> <Date> <Node> <Time> <NodeRepeat> <Type> <Component> <Level> <Content>
Variable noise patterns: Core dump references (core\.\d+).
HPC
Generic HPC cluster log. Produced by batch schedulers and system daemons on compute nodes.
5 hpc-lnx-node11 afs RUNNING 1169880007 OK File transfer: 23 files copied 0 bytes in 0.000 secs
log_format: <LogId> <Node> <Component> <State> <Time> <Flag> <Content>
Variable noise patterns: Numeric assignments (=\d+).
Thunderbird
Thunderbird supercomputer log. Similar to BGL but from a different cluster; includes user and location fields.
- 1131566401 2005.11.09 admin Nov 9 21:00:01 tbird1 crond[3049]: (root) CMD (/usr/lib/sa/sa1 1 1)
log_format: <Label> <Timestamp> <Date> <User> <Month> <Day> <Time> <Location> <Component>([<PID>])?: <Content>
Variable noise patterns: IP addresses ((\d+\.){3}\d+).
Operating Systems
Linux
Linux syslog format (RFC 3164). Produced by syslog, rsyslog, and syslog-ng — covers auth, cron, kernel messages, and more.
Jun 14 15:16:02 combo sshd(pam_unix)[19939]: authentication failure; logname= uid=0 euid=0 tty=NODEVssh ruser= rhost=218.188.2.4
log_format: <Month> <Date> <Time> <Level> <Component>([<PID>])?: <Content>
Note:
<Level>captures the hostname in standard syslog (the field name follows benchmark convention).
Variable noise patterns: IP addresses ((\d+\.){3}\d+), embedded timestamps (\d{2}:\d{2}:\d{2}).
Windows
Windows Component-Based Servicing (CBS) / Setup log. Fixed-width columns with variable spacing.
2016-09-28 04:30:22, Info CBS Loaded Servicing Stack v6.1.7601.23505 with Core: C:\Windows\winsxs\amd64_microsoft-windows-servicingstack_31bf3856ad364e35_6.1.7601.23505_none_681aa442f6fed7f0\cbscore.dll
log_format: <Date> <Time>, <Level> <Component> <Content>
Note: The multiple spaces between fields are literal and must be preserved in the pattern.
Variable noise patterns: Hexadecimal values (0x.*?\s).
Mac
macOS unified system log (pre-Unified Logging). Produced by system daemons and user-space processes.
Jul 9 00:17:03 calvisitor-10-105-160-95 com.apple.backupd[12396] (Error): VSDBUtil: Failed to change_attributes for volume "/Volumes/SanDisk USB Drive"
log_format: <Month> <Date> <Time> <User> <Component>[<PID>]( (<Address>))?: <Content>
Note: Double space between
<Month>and<Date>occurs when the day is single-digit (standard macOS padding).
Variable noise patterns: Domain names (([\w-]+\.){2,}[\w-]+).
Android
Android logcat output. Produced by the Android logging system across all framework and app components.
01-02 12:23:42.768 1632 2044 E ActivityManager: mDVFSHelper is null
log_format: <Date> <Time> <Pid> <Tid> <Level> <Component>: <Content>
Variable noise patterns: File system paths ((/[\w-]+)+), domain names, hex/numeric values (\b(\-?\+?\d+)\b|\b0[Xx][a-fA-F\d]+\b|\b[a-fA-F\d]{4,}\b).
Application
HealthApp
Mobile health application log. Pipe-delimited format from a step-counter app.
20170201-01:15:35:631|Step_StandCounter|4243|step counts since boot=76149; goal=6000; active=0; distance=45
log_format: <Time>|<Component>|<Pid>|<Content>
Go back to Index