Resource Automation for Cloud Computing – an Overview


The Resource Automation Series is finished so far – but it will continue once there is new exciting content. There is still a lot to be covered regarding resource automation. With this post, I want to give an Overview of what was discussed. In the first post, I discussed if and how Cloud Computing will change the datacenter design. We were talking about standardisation in the datacenter and how Cloud Computing affects this. The next post was about how automations in datacenters. Cloud Computing needs us to think different since we don’t talk about 10 or 100 servers but rather tens of thousands. This also leads to various problems, which are discussed in this blog post. Resource Automation also requires a cooperation between Developer Teams and Operation Teams. This is often referenced as “DevOps“. How deployment can be done, requires different strategies. Resource Automation leads to several problems that need to be addressed. One approach is to apply Datacenter Event

read more Resource Automation for Cloud Computing – an Overview

Resource Automation in the Cloud: Datacenter Automation and Integration


This is the last post on our series about resource automation in the Cloud. Today, we will look at datacenter automation and integration. What are the Benefits for Data Center Automation? First of all, it frees up IT staff. If more things are automated, you don’t need to allocate resources for that. Your IT can care about more important things. You should automate things that are repeatable, such as provisioning machines. With Data Center Integration, you leverage best capabilities of Existing systems Processes Environments Key areas for Datacenter Automation are: Reducing labor costs by allowing reduction or reallocation of people Improving service levels through faster measurement and reaction times Improving efficiency by freeing up skilled resources to do smarter work Improving performance and availability by reducing human errors and delays Improving productivity by allowing companies to do more work with the same resources Improving agility by allowing rapid reaction to change, delivering new processes and applications faster Reducing reliance on high-value

read more Resource Automation in the Cloud: Datacenter Automation and Integration

Resource Automation in the Cloud: Monitoring


This is a follow-up post on the series about resource automation in the Cloud. In this part, we will look at monitoring. Monitoring is not the easiest thing to do in distributed systems. You have to monitor a large number of instances. The challenge is to find out what you want to monitor. If you run your application (such as a SaaS-Platform) you might not be interested in the performance of a single instance but in the performance of the application itself. You might not be interested in the I/O performance of an instance but again of the overall experience your application delivers. To find that metrics, you have to invest significant experience into monitoring. Let us look at how monitoring works basically. There are 2 key concepts to monitor instances: Agent-less Monitoring Agent-based Monitoring If we talk about Agent-less monitoring, we have two possibilities: Remotly analyse the System with a remote API (e.g. Log Data on File System) Analyse Network

read more Resource Automation in the Cloud: Monitoring

Resource Automation in the Cloud: Self-service IT


This is a follow-up Post to the Series on Resource Automation in the Cloud. This time we will talk about self-service IT. Self-service IT is an important factor for Automation. It basically enables users to solve their problems and not to talk about the technology. For instance, if the marketing department needs to run a website for a campaign, the IT should enable the department to start this “out of the box”: an empty website template (e. g. WordPress-based) should be started. Furthermore, scaling should be enabled, since the load will change over time. The website should also be branded in the corporate design. The target of self-service IT is that the IT department provides Tools and Services that gives the users more independence. Another sample for self-service IT is the launch of virtual instances. This is a rather easy thing to accomplish as this can be handled by self-service platforms such as OpenStack, Eucalyptus or various vendor platforms. To

read more Resource Automation in the Cloud: Self-service IT

Resource Automation in the Cloud: Event Lifecycle Management


This is the follow-up post to our Series “Resource Automation”. In this part we will focus on Event Lifecycle Management and the associated challenges we face in terms of resource Automation. By Event Lifecycle Managment, we basically mean what happens if events in the datacenter or cloud occur. Events can be different things but most likely they are of the type “errors” or “warnings”. If an error occurs, this is triggered as an event and necessary steps will be taken. In each of the Steps of the Lifecylce, erros can and will occur. So we have to take that into account. Responding to errors is critical for the business. Just imagine what happens if you fail to deliver a service level. More calls will be received by your support Desk and you not only have a technical problem but also an organisational. In many cases, the support desk is outsourced and it can be sort of scaled on demand – however,

read more Resource Automation in the Cloud: Event Lifecycle Management

Resource Automation in the Cloud: Problems


The next few posts will deal with resource automation in the Cloud. As with other tasks, this is not a thing you solve in one day nor will it be easy. In the first post on resource automation, we will look at the main issues related to resource automation before we dig deeper into resource automation itself. The major areas of resource automation problems are: Service Level Monitoring Event Lifecycle Management IT Self Service Service Level Monitoring SLAs are key metrics in Cloud-based environments (but not only!). A key issue is that Response Times are seldom in SLAs. Metrics that should be contained in SLAs are:  Application availability, Storage availability, CPU use, Network I/O, Events and many more. A key issue is that there are several sources that SLAs have to be built from: Management Tools Monitors Logs Services There are several reasons why building SLAs manually is not that effective: Higher staff levels and traing costs Higher service desk costs Increased risk of errors

read more Resource Automation in the Cloud: Problems