Catchpoint Network Insights gives us confidence about what is happening with both our internal systems and on external servers, so we can proactively ensure availability and reachability.
With Datadog as a partner for internal monitoring and the decision not to build a solution in house and distract its lean team from the core business, Team Internet evaluated Catchpoint’s Network Observability solution. Lunz quickly discovered that this out-of-the-box solution would address a major visibility gap for Team Internet by combining Border Gateway Protocol (BGP) monitoring with Domain Name System (DNS), and Traceroute.
With Catchpoint, Team Internet can ensure that its widely distributed architectural components – ISPs, wireless networks, cloud providers, and BGP routes – all work together seamlessly to ensure its servers are available and that its services are available and reachable at all times.
Ensuring availability and reachability
Now the company has a nearly immediate and clear view into situations that it was once blind to, such as providers being unable to reach its platform. Specific incidents include a UK provider being unable to reach Team Internet because of an issue with a BGP announcement. More recently, traffic from Russia was routed unexpectedly.
“We can’t see these occurrences solely using ServerGuard24 because it operates in just a few locations. With Catchpoint, we can check from so many more locations around the world,” says Lunz.
“We were never able to monitor reachability in this detail before using the Catchpoint solution. Now we can figure out whether an issue is related to a BGP announcement, routing or something else instead of blindly trusting our partners that everything is working as it should,” he says.
Gaining monitoring flexibility
According to Lunz, Catchpoint stands out because of its ability to check intervals down at a granular level.
“We need the flexibility to check network conditions every 5 or 10 seconds so we can zero in on specific issues, and most solutions only check one time per minute.”
Appreciating support for SSL tests
Team Internet must synchronize all SSL to its data centers, even those where it houses its own servers. With SSL tests enabled via Network Insights, it can terminate SSL and mitigate CPU calculations on data center servers, even those hosted by AWS.
“This will allow us to monitor how long it takes to synchronize our certificates in all data centers so we can take measures if the synchronization will be too slow,” Lunz continues.
Streamlining on-call responses
Integration between Catchpoint and OpsGenie, Team Internet’s incident management tool, ensures that on-call staff is only woken up for critical issues. Plus, because Catchpoint’s BGP data is delivered in real time, Team Internet can be alerted to route anomalies as they happen so they can detect and remediate them quickly.
“The staff on on-call rotation are the same people developing our solutions so seeing what is happening beyond our network and across the Internet is very valuable,” says Lunz.
Optimizing operations and driving strategic growth
Deeper monitoring and better insights have empowered Team Internet to increase the availability of its services.
“We can’t fix issues in networks beyond our control but we can see them and tell our partners so they proactively plan accordingly,” Lunz explains.
In fact, Team Internet is now empowered with a level of foresight that enables it to operate at a new level.
“We can recognize and locate issues and the networks where they are occurring and keep our partners up to date so they can optimize their traffic purchases. We can even spot potential issues in advance, such as by seeing rising latency in a region,” he continues.
In the past, Team Internet’s partners and the sales team often notified Lunz’s team about availability issues.
“Now we are aware of the issues before anyone else. This will help us deliver a new level of service and prevent huge losses due to issues that happen outside of our network. Our IT Operations Lead exports graphs and BGP announcements from Catchpoint so our data center teams are empowered when reaching out to peering and upstream partners,” Lunz explains.
In fact, Lunz and his team are making use of the insights from Catchpoint to strategically plan for Team Internet’s expansion.
“With Catchpoint, we can detect latency issues and make informed decisions about setting up data centers in new regions. We are already using the insights to optimize our name servers and increase DNS responses. We’ll do the same with our edge servers to move even closer to users,” Lunz concludes.
Empowering the business intelligence team
Lunz anticipates that Team Internet will get additional value from Catchpoint Network Observability solution going forward. Currently, the company writes about 1 TByte of data every day but the Business Intelligence (BI) team wants to focus on areas that can positively impact the business. With Catchpoint's network observability solution, Lunz can provide the BI team with more detailed information so the team can pinpoint potential issues ahead of time.
“This will also empower them to make smart decisions about where we should deploy new data centers to expand revenue,” he explains.