Cloud monitoring in the DevOps world

Coupling the cloud with a DevOps culture can often be key to optimising productivity and meeting your technological goals.

With the cloud being an effective approach to ensuring scalability and adaptability in today’s working world, coupling it with a DevOps culture can often be key to optimising productivity and meeting your technological goals.

You can think of DevOps as a culture built around related practices and toolsets. A key benefit of DevOps is that it shortens the development lifecycle because the development and operations teams are more aligned and collaborate from the start of a project. The DevOps model relies on effective tooling to help teams rapidly and reliably deploy and innovate for their clients. These tools automate manual tasks, help teams manage complex environments at scale, and keep engineers in control of the high deployment velocity that is enabled by DevOps.

But, more often that not, businesses are using the cloud as part of their overall technical landscape and to meet their strategic goals.

One of the places that DevOps and the cloud intersect is cloud monitoring. As the system grows bigger and release cycles shorten, it’s important to ensure that engineering teams have access to (near) real-time information. This ensures that subsequent changes are based on the true state of an environment.

What is cloud monitoring?

Cloud monitoring uses manual and automated tools to monitor, analyse and report on the availability and performance of websites, servers, applications and other cloud infrastructure. For example, cloud monitoring tools enable you to test an application for speed, functionality, and reliability to help ensure that it is performing optimally.

Cloud monitoring is generally performed as part of an overall cloud management strategy, enabling IT administrators to review the operational status of cloud-based resources. It also provides a holistic view of cloud metrics, customer interaction with the system (real user monitoring), log data and more.

Best practices

Based on almost 40 years’ experience as an international software development company, and many successful cloud implementations under our belt, adopting these best practices help us achieve optimal cloud uptime and performance:

Identify key performance indicators (KPIs) and other metrics that affect your business’s bottom line and the overall user experience. When it comes to cloud environments, there’s a lot to monitor, but not everything warrants close attention. Designating which KPIs and metrics you want to track prior to implementing a cloud monitoring strategy will give you a clear sense of what to prioritise.
Group underlying components into their applications and map them to relevant business services. Given that cloud environments are highly complex, it’s critical to understand the relationships between individual resources and build that information into the monitoring system. This allows for a more comprehensive understanding of how an issue within one component might affect the broader application and, more importantly, business and end-users.
Keep a close eye on cloud service usage and fees. The beauty of cloud computing is that it’s highly scalable. But increased usage can result in higher costs, so make sure your cloud monitoring solution is tracking usage activity and associated costs. In an ideally architected and configured environment, usage costs should not increase in lockstep with user activity i.e. there are economies of scale that can be benefitted on by using the cloud.
Establish good baselines. Different applications have different base activity levels. It’s important that you know what constitutes as normal for each so that your cloud monitoring solution automatically scales your computing infrastructure to maintain peak performance levels if an app exceeds its baseline or to keep costs down if it falls below.
Consolidate all data within a single, centralised platform. It’s important that all your cloud monitoring data – including data pulled from multiple different sources – live in one place so it’s easily accessible (for further processing) and consistent, and so that you have a holistic view of cloud performance.

AWS tools to accurately monitor cloud environments

In the case of environments built on AWS infrastructure, these are some of the tools we use for successful delivery:

CloudWatch is a monitoring and management service that provides data and actionable insights for AWS, hybrid, and on-premises applications and infrastructure resources
Grafana is an open-source solution for running data analytics, and ingesting metrics that make sense of the massive amounts of data our systems generate. It facilitates the monitoring of our apps by summarising useful information with the help of cool, customisable dashboards
Prometheus is an open-source and community driven performance monitoring solution. It also supports container monitoring and creation of rules which trigger alerts based on time series data
AppDynamics facilitates real-time insights into application performance. This DevOps tool monitors and reports on the performance of all transactions flowing through your application
DataDog is an observability service for cloud-scale applications, providing monitoring of servers, databases, tools, and services, through a SaaS-based data analytics platform
Splunk is a software platform widely used for monitoring, searching, analysing and visualising machine-generated data in real time. It performs capturing, indexing, and correlation of the real time data in a searchable container and produces graphs, alerts, dashboards and visualisations
Loki is a horizontally scalable, highly available, multi-tenant log aggregation system inspired by Prometheus
AWS Xray is a service that helps engineers analyse and debug distributed applications by enabling them to follow requests as they flow through the system. X-Ray is used to monitor application traces, including the performance of calls to other downstream components or services, in either cloud-hosted applications or from their own machines during development
OpenTracing (Jaeger) is open-source software for tracing transactions between distributed services. It’s used for monitoring and troubleshooting complex microservices environments

Adopting a DevOps culture alongside your cloud environment can be key to helping businesses effectively deliver resilient services and applications at a rapid pace. If you’re looking for a software development partner with extensive experience implementing both the DevOps culture and cloud deployments, reach out to us.

South Africa as a data-safe safe choice for EU businesses

Digital strategy, Software development, Tech & business consulting

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie stores and identifies a user's unique session ID to manage user sessions on the website. The cookie is a session cookie and will be deleted when all the browser windows are closed.
rc::a	never	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::b	session	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::c	session	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
rc::f	never	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zalb_*	session	Zoho sets this cookie for load balancing and session stickiness. It ensures that user requests are consistently directed to the same server during a session, helping maintain session integrity and improving website performance.

Cookie	Duration	Description
_zcsr_tmp	session	Zoho sets this cookie for the login function on the website.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
yt-remote-cast-available	session	The yt-remote-cast-available cookie is used to store the user's preferences regarding whether casting is available on their YouTube video player.
yt-remote-cast-installed	session	The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video.
yt-remote-fast-check-period	session	The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos.
yt-remote-session-app	session	The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player.
yt-remote-session-name	session	The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video.
ytidb::LAST_RESULT_ENTRY_KEY	never	The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_ga_0NYLJN3XCS	2 years	This cookie is installed by Google Analytics.
_ga_GY82V894KJ	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_*	1 minute	Google Analytics sets this cookie to store a unique user ID.
_gat_gtag_UA_28875611_4	1 minute	Set by Google to distinguish users.
_gcl_au	3 months	Google Tag Manager sets the cookie to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_tt_enable_cookie	3 months	Tiktok set this cookie to collect data about behaviour and activities on the website and to measure the effectiveness of the advertising.
_ttp	3 months	TikTok set this cookie to track and improve the performance of advertising campaigns, as well as to personalise the user experience.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies store information about how the user uses the website to present them with relevant ads according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
muc_ads	1 year 1 month 4 days	Twitter sets this cookie to collect user behaviour and interaction data to optimize the website.
personalization_id	1 year 1 month 4 days	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__Secure-ROLLOUT_TOKEN	6 months	Description is currently not available.
AnalyticsSyncHistory	1 month	No description
crmcsr	session	No description available.
gclid	1 month	Description is currently not available.
li_gc	5 months 27 days	No description
ln_or	1 day	No description
ttcsid	3 months	Description is currently not available.
ttcsid_CP28EQBC77U5LTIRJOS0	3 months	Description is currently not available.

Cloud monitoring in the DevOps world

Coupling the cloud with a DevOps culture can often be key to optimising productivity and meeting your technological goals.

What is cloud monitoring?

Best practices

AWS tools to accurately monitor cloud environments

Related articles

South Africa as a data-safe safe choice for EU businesses

Transforming ESG: Nearshore teams delivering excellence

BBD launches new BBD Cloud Solutions offering

What’s next? We’re ready!

Cloud monitoring in the DevOps world

Coupling the cloud with a DevOps culture can often be key to optimising productivity and meeting your technological goals.

What is cloud monitoring?

Best practices

AWS tools to accurately monitor cloud environments

Related articles

South Africa as a data-safe safe choice for EU businesses

Transforming ESG: Nearshore teams delivering excellence

BBD launches new BBD Cloud Solutions offering

What’s next? We’re ready!

Subscribe for updates