Service Discovery in microservices

07 October 2019

So until and unless anyone of us was living in a cave, we all would have heard about the buzzword microservices. It's not that you always need to build microservices for every small functionality, a lot of problems can be easily solved by the monolith.

Whether you need microservice or not can be answered based on a few questions

Do you have independent business domains that are independent and functions as a separate entity within applications?
Do you have a large team of developers, who are split into small teams based on a business domain?
Do you want to roll out new functionality for different modules quickly and independent of other modules?

These are not the complete list of questions, but these can be served as the base to decide whether you should go for microservices or not.

So let's now assume we have decided to go with microservices approach and we have designed our microservices to be loosely coupled with each other. Now there are two problems, we may want to solve

Cross Service Communication here would be a situation, where we need to access some functionality of other services. In the monolithic approach, it would be just some function call. But in monolith services are not in the same code base, let's see what are the different options we can have.
How External world can access microservices functionality? There could be the external application or REST API exposed for a web application for each service. How request for these microservices can be mapped?

Case 1: All Microservices are deployed in all machines

Microservices in Single Instance

So, here we have deployed all the services together on the same machine. All the services are hosted on different ports and they have exposed REST endpoints.

Whenever the gateway receives a request for any service, it will simply redirect to the given instance, solving the first problem. These endpoints can be called by respective services instead of calling the function of the services, solving the second problem.

Although this approach solves the problem of cross-service communication, it has some of its disadvantages too.

Bigger machine size is required, as all the services are deployed in a single machine. Different services can have different requirements. some services may only need higher processing, others only need higher memory. Considering all of these constraints, we may have to choose a higher configuration machine, which can accommodate all of these services. As the number of services grows, we would have to increase the size of the machine too.
monitor health check of all services and mechanism to handle failure of single service. Either by restarting service or allocating new machines dynamically.
Some services don't need to be up and running always.
Gateway needs to be hardcoded for port no of each service and need to be updated for each new service. Even service need to hard code port no of service, which they want to call.
The above solution will work in case of a single instance, but in the real world, there will be multiple instances. To handle this, we will need a Loadbalancer before those sets of instances so the requests are being distributed evenly between those instances from the gateway.

Case 2: Deploy Services independently & use Service Discovery

microservice in service discovery

Let's understand what is happening here. All the services are hosted in different machines, it's completely up to us how we want to deploy them. They can be clubbed together or deployed separately on any of the instances. Each service registers itself into Service Registry.

_{NOTE: There are different deployment orchestration technologies are available. Those are out of context for this article.}

Service Registry is a key-value pair database. The database consists of unique id for each service, service name, machine IP address, port number and Health Monitoring API of each service. Service Registry will invoke health monitoring API on the fixed interval to check if the service is healthy or not.

Now let's see how the above problems are being solved

Cross Service Communication Consider the first scenario where Service1 wants to access Service3. To do so, Service1 will query the service registry database for service3 and fetch the list of instances where Service3 is available. Using the IP address and port number, Service1 can now access Service3. This is also called Client-Side Service Discovery pattern.

How External world can access micro-services functionality? Consider the second scenario, where Gateway wants to redirect the external request to Service1. Gateway queries the Service Registry and fetches the list of instances where Service1 is available. Using the IP address and port number Gateway redirects the request to any of the machines where Service1 is available. The approach where external clients are accessing micro-services using service discovery is called Service-Side Service Discovery pattern.

Benefits of Approach 2

No need to hard code IP address or port number of services, as it can find out dynamically at the time of request only.
Deployment is independent of micro-service communication architecture.
Based on their health status request can be sent to only healthy services
No need for any kind of internal loadbalancer with static ip for services, as service address can find out dynamically. But the load balancing strategies need to be implemented at gateway, so we are not evenly distributing load between instances/pods of single service.
Strategies to distribute the load evenly between available machines can be implemented at client-side and gateway.

BONUS

As we have seen, Gateway has played a crucial role in both the approaches, Gateway is routing the requests from external clients to respective microservices. It's serving as the back-end for any external client. This approach is called API Gateway pattern.