When monitoring Kubernetes clusters using Elastic Stack, the volume of logs can be overwhelming, often reaching gigabytes per minute. This is particularly true for OpenShift clusters, where significant traffic originates from system namespaces you might not be familiar with.
Optimizing log collection becomes crucial for maintaining system efficiency and resource utilization. Success in this endeavor relies heavily on implementing the right collection strategy and understanding which logs are truly valuable for your monitoring needs
A systematic approach to log management starts with proper monitoring. While sophisticated tools aren’t necessary, a must have is a basic dashboard which tracks two essential metrics:
These visualizations help identify:
Remember that observability’s primary purpose is to provide clear insights into cluster operations. If you identify gaps in your monitoring coverage, don’t hesitate to implement additional visualizations that address them.
Once you’ve identified the namespaces and applications generating the highest volume of logs, you can begin the optimization process.
Start by evaluating the necessity of all logging outputs. Are all current logs essential? If yes, focus on retention policies. If no, consider reducing application verbosity.
An often overlooked aspect of log management is adjusting application verbosity levels. Default settings rarely align with specific organizational needs, making this adjustment crucial for optimal log management.
The process of modifying verbosity levels is typically straightforward. Consult the application’s documentation, locate the configuration file (usually in YAML format), and adjust the related setting with a simple line edit.
In certain scenarios, you may encounter limitations or specific requirements when managing log verbosity:
The Elastic Agent offers a solution through its processors configuration in the integration policy. The following configuration demonstrates how to exclude monitoring namespaces and specific pod logs:
- drop_event:
when:
or:
- equals:
kubernetes.namespace: openshift-monitoring
- equals:
kubernetes.namespace: monitoring
- regexp:
kubernetes.pod.name: "openshift-pipelines-operator-*"
This example of processor configuration will:
monitoring
namespacesopenshift-pipelines-operator-*
The processors feature offers extensive capabilities beyond this example. For comprehensive information about available processors and their configurations, consult the related docs.
To maintain the effective operation of your Elastic cluster’s log ingestion, it is important to carefully apply and adjust these optimization steps according to your specific requirements. As you tailor these strategies, it is advisable to err on the side of caution—when uncertain about certain log entries, retaining them is generally the best approach to prevent the loss of potentially valuable information.
If you find that these optimization steps have not sufficiently reduced your log volume, it may be beneficial to consider implementing custom log retention policies. This is a topic that will be explored further in a future post.
Interest in more readings about monitoring OpenShift? You might find this article useful!
Did you find this article interesting? Does it match your skill set? Programming is at the heart of how we develop customized solutions. In fact, we’re currently hiring for roles just like this and others here at Würth Phoenix.