current location:Home > Technical Articles > Operation and Maintenance > Safety
- Direction:
- All web3.0 Backend Development Web Front-end Database Operation and Maintenance Development Tools PHP Framework Daily Programming WeChat Applet Common Problem Other Tech CMS Tutorial Java System Tutorial Computer Tutorials Hardware Tutorial Mobile Tutorial Software Tutorial Mobile Game Tutorial
- Classify:
-
- Ansible introduction and practical operation demonstration
- 1. Overview Ansible is a new automated operation and maintenance tool. It is developed based on Python and integrates the advantages of many operation and maintenance tools (puppet, cfengine, chef, func, fabric) to realize batch system configuration, batch program deployment, and batch running commands. and other functions. Features of Ansible: Simple deployment, you only need to deploy the Ansible environment on the master control end, and the controlled end does not need to do any operations. By default, the SSH protocol is used to manage the device. Master-slave centralized management. Simple configuration, powerful functions, and strong scalability. Supports API and customization. Modules can be easily extended through Python to customize powerful configuration and status management through Playbooks for cloud computing platforms and big data.
- Safety 1004 2023-06-09 19:35:44
-
- Docker log size limit and switching storage directory
- Many CentOS virtual machines are used in various environments of the product. By default, the root directory space is 50G. As the use time becomes longer and longer, the space will become insufficient. The method that has been adopted is to clear useless images and Docker logs, so that the server will always be available. It wasn't until cleaning was useless that I thought about finding other methods. When we are not forced into a desperate situation, we will habitually rely on methods and tools that we are familiar with. It may not be efficient, but it can solve the problem. This kind of thinking is not unusable, but it is harmful and will make people stay in the comfort zone. Be willing to come out. The other methods mentioned above are actually very simple and can be obtained by checking the information. They are to limit the size of Docker logs and reduce the size of Docker logs.
- Safety 1388 2023-06-09 19:18:42
-
- To end this topic: Is it true that operation and maintenance jobs can no longer be done?
- Last Friday, Ma Chi and Lai Wei had an online discussion. The topic was: Are operation and maintenance positions really no longer available? As the host, I am both the igniter and the facilitator:) I benefited a lot from listening to the two veterans share some of their respective opinions. Make sure to record it today so as not to forget it. It is a review of the live broadcast. About the tool platform The tool platform will replace part of the labor force. This is actually obvious and needs no further explanation. But who will build the tool platform? This is worth checking out. Monitoring systems, CI/CD platforms, chaos engineering platforms, middleware services, etc. are all Platforms, built by PlatformEngineer, referred to as PE. PE is obviously divided into many groups, and each PE group is responsible for a limited number of platforms. These scattered
- Safety 1354 2023-06-09 18:57:47
-
- How to set up a high-availability Apache (HTTP) cluster on RHEL 9/8
- Pacemaker is high-availability cluster software for Linux-like operating systems. Pacemaker is known as the "cluster resource manager" and it provides maximum availability of cluster resources by failover of resources between cluster nodes. Pacemaker uses Corosync for heartbeat and internal communication between cluster components. Corosync is also responsible for voting (Quorum) in the cluster. Prerequisites Before we begin, make sure you have the following: Two RHEL9/8 servers RedHat subscription or locally configured repository Access to both servers via SSH root or sudo permissions Internet connection Lab details: Server 1: node1.exa
- Safety 1472 2023-06-09 18:20:27
-
- HAS 2023 | Intelligent Operations and Maintenance Roundtable, discussing the new generation of intelligent operation and maintenance of the '30' vision
- [Shenzhen, China, April 27, 2023] At the recently held Huawei Analyst Conference, Mr. Ye Rongchun, Director of the Intelligent Operation and Maintenance Business Department of Huawei's Global Technical Services Department, met with operators, vertical industry partners and global TOP analysts from around the world. Organizations gathered together to share and communicate around the vision, key elements and technologies of the new generation of intelligent operation and maintenance. [HAS2023 Huawei Intelligent Operation and Maintenance Roundtable] Mr. Ye Rongchun shared his understanding of the digital transformation of operation and maintenance, intergenerational operation and maintenance and its elements, and introduced Huawei AUTIN intelligent operation and maintenance solutions and practices. 1. The intergenerational evolution of operation and maintenance is the result of dual drivers of business and technology. In terms of technology drivers, Huawei believes that in addition to being driven by general-purpose technologies (such as AI), it is also necessary to have breakthroughs in domain technologies and even theoretical breakthroughs.
- Safety 1105 2023-06-09 17:52:28
-
- Uber Practice: Some experiences in operating and maintaining large-scale distributed systems
- This article is an article by Uber engineer Gergely Orosz. The original address is: https://blog.pragmaticengineer.com/operating-a-high-scale-distributed-system/ In the past few years, I have been building and operating a large-scale Distributed systems: Uber’s payment system. During this period, I learned a lot about distributed architecture concepts and witnessed first-hand the challenges of running high-load and high-availability systems (a system is far from finished when it is developed, and the challenges of running it online are actually even greater). Building the system itself is an interesting endeavor. How planning systems handle 10x/100
- Safety 708 2023-06-09 16:53:49
-
- Well Source: Operational and Maintenance Geometry
- Editor: Boss Jing was the boss of my team when I joined Baidu in 2011. He is a hard-core veteran. It was not easy to seize this opportunity. He asked all the common questions in the industry for the benefit of readers. Boss Jing has a free and easy nature, and his jokes and curses are all written down, and his principles are easy to understand. This is the first issue of the down-to-earth and high-level "Operation and Maintenance Forum", starting now! Guest introduction: Jing Yuan, first from left, former Baidu operation and maintenance architect, former Xiaomi operation and maintenance leader, former Meicai CIO. Some operation and maintenance personnel reported that the company knew very little about the value of operation and maintenance. How did you explain operation and maintenance clearly to the company back then? What about the value of dimension? First of all, you need to clearly explain the job responsibilities of operation and maintenance (what operation and maintenance does and what it produces) and key indicators (measuring output results) with the company. For example, the work will focus on stability, safety, efficiency, etc., and start
- Safety 1304 2023-06-09 16:50:53
-
- Three surprising things you can do with systemd
- It's not just about making your computer boot up faster. When systemd first came out, there was a lot of news about how it could speed up boot times. This feature appeals to most people (less so to those who don't reboot), so in many ways, that's the reputation it still has today. While systemd does play a role in starting services in parallel during the startup process, it does much more than that. Here are three things you might not realize systemd can do, but you should take advantage of. 1. Simplify Linuxps If you have ever used ps, or even just the top command, then you know that your computer has hundreds of processes running at all times. sometimes it's you
- Safety 1234 2023-06-09 15:38:15
-
- Beginner's Guide: Kubernetes Observability Basics
- In today's complex software development environment, ensuring that applications run smoothly is critical. Observability is a key aspect in infrastructure management, helping development and operations teams gain insights into the performance and health of systems, effectively detect and resolve issues, and ultimately deliver a better user experience. Kubernetes is an open source container orchestration engine used to automate the deployment, scaling and management of containerized applications. As Kubernetes grows in popularity, it becomes critical to understand how to monitor and observe these clusters. In this article, we will introduce the concept of observability and its three main pillars: metrics, logs, and traces. We’ll explore the observability features built into K8s and introduce some popular Kubernetes enhancements
- Safety 1301 2023-06-09 14:22:42
-
- K8s enhanced workload OpenKruise operation and maintenance enhancements
- We have already learned the basic concepts of OpenKruise and several commonly used enhanced controllers. Next, we will continue to learn about other advanced functions. SidecarSetSidecarSet supports automatically injecting sidecar containers into qualified Pods created in the cluster through admissionwebhook. In addition to injecting them when the Pod is created, SidecarSet also provides the ability to upgrade the sidecar container image that has been injected into the Pod in place. SidecarSet decouples the definition and life cycle of sidecar containers from business containers. It is mainly used to manage stateless sidecar containers, such as monitoring, logging, etc.
- Safety 818 2023-06-09 14:13:06
-
- Tuyou Zou Yi: How to operate and maintain small and medium-sized companies?
- Through interviews and submissions, veterans in the field of operation and maintenance are invited to provide profound insights and collide together, with a view to forming some advanced consensus and promoting the industry to move forward better. In this issue, we invite Zou Yi, the operation and maintenance director of Tuyou Games. Mr. Zou often jokingly calls himself the operation and maintenance representative of the world's top 5 million companies. It can be seen that in his heart, he feels that the operation and maintenance construction ideas of small and medium-sized companies are different from those of large enterprises. There are differences. Today we have a few questions and ask Mr. Zou to share his journey of integrating research and operations for small and medium-sized companies. This is the 6th issue of the down-to-earth and high-level "Operation and Maintenance Forum", starting now! Question Preview Tuyou is a game company. What do you think are the unique features of game operation and maintenance? What are the biggest operational challenges you face? How did you solve these challenges? Game operation and maintenance people
- Safety 1025 2023-06-09 13:56:08
-
- A must-read for operation and maintenance monitoring: A must-know InfluxDB usage guide that can be used at critical moments
- What is InfluxDB? InfluxDB is an open source sequential database developed by InfluxData. It is written in Go and focuses on high-performance query and storage of time series data. InfluxDB is widely used in scenarios such as monitoring data of storage systems and real-time data in the IoT industry. Technical features include: InfluxDB fully utilizes the characteristics of the Go language in technical implementation and can be deployed independently without any external dependencies [5]. InfluxDB provides a SQL-like query language and a series of built-in functions to facilitate users to query data. The data stored in InfluxDB is logically composed of Measurement, tag group, field group and a
- Safety 1716 2023-06-09 13:40:49
-
- From a CTO perspective: How to build operation and maintenance/SRE capabilities
- Recently, there have been many articles discussing the issue of whether to retain or retain operation and maintenance positions. The SRETalk public account I hosted also posted the opinions of many operation and maintenance directors. I also personally communicated with many people in the industry and had some small ideas. , record it for reference by CTO/CIO. As an operation and maintenance/SRE, if you feel confused, I also recommend that you read this article carefully. I think this is an in-depth thinking, it may be boring, but it will be helpful for career choice and team building. This article welcomes well-founded discussions, but does not welcome arrogance. In addition, many things are not black and white. It is great if the content of the article can inspire you and bring new thinking to CXOs' decision-making. In addition, SRETalk’s interview with the Operations and Maintenance Director will continue, and there will be more
- Safety 879 2023-06-09 12:37:08
-
- Use Terraform to manage OpenStack clusters
- After having an OpenStack production environment and home lab for a while, I can safely say that provisioning workloads and managing it from both an administrator and tenant perspective is important. Terraform is an open source Infrastructure as Code (IaC) software tool for provisioning networks, servers, cloud platforms, etc. Terraform is a declarative language that serves as a blueprint for the infrastructure you are building. You can manage it with Git, which has a powerful GitOps use case. This article introduces the basics of using Terraform to manage OpenStack clusters. I recreated Open using Terraform
- Safety 1222 2023-06-09 12:20:32
-
- Detailed explanation of MBR and GPT partitions, have you learned it?
- 1. MBR partition MBR is the abbreviation of MasterBootRecord. It is an old partition table format used to identify and manage partitions on the hard disk. The MBR partition table can identify up to 4 primary partitions or 3 primary partitions and 1 extended partition. The 2TB limit means that a hard disk using the MBR partition table format can only be partitioned to a maximum of 2TB. Because MBR uses the 32-bit LBA (Logical Block Addressing) addressing method, the maximum address can only represent 2^32 sectors, and each sector The size is 512 bytes, so the maximum supported hard drive size is 2TB. In order to support larger hard disk capacities, hard disk manufacturers and operating system developers introduced GPT (GUID Partit
- Safety 11319 2023-06-09 11:02:28