For golang performance diagnosis, it is enough to read this article-Golang-php.cn

Table of Contents

2.1 profiling" >2.1 profiling

" >#2.1. 1 CPU Profiling

2.2 tracing" >2.2 tracing

2.3 GC" >2.3 GC

Home

Backend Development

Golang

For golang performance diagnosis, it is enough to read this article

Go语言进阶学习

Jul 24, 2023 pm 04:18 PM

golang Performance diagnostics

We are exposed to performance diagnosis problems on a daily basis, which are generally divided into two situations. One is that online applications really have performance problems; the other is that we need to estimate the performance of the system that is ready to go online; the latter requires stress testing. Auxiliary progress is not shown here.

For GO applications, performance diagnostic tools are mainly divided into two layers: OS level and GO application level (go tool pprof /trace /gc)

OS Diagnosis

## System diagnosis, we generally focus on three aspects: CPU, Memory, I/O

##1.1 CPU

CPU diagnosis mainly focuses on load average, CPU usage, and context switch. Commonly used

top commands are used to check the cpu usage and server load.

For golang performance diagnosis, it is enough to read this article Average load: 0.14 0.07 0.06 represents the average machine load in the past 1 minute, 5 minutes, and 15 minutes respectively. According to experience, if the load value is less than 0.7*CPU The number is normal. If it exceeds or reaches four or five times the number of CPU cores, the system load will be obviously high.

CPU context switching can be viewed

through the

vmstat command. The scenarios in which context switching occurs are as follows: Type:

The time slice is used up, the CPU normally schedules the next task
is Other tasks with higher priority are preempted
When the execution task encounters I/O blocking, the current task is suspended and switched to the next task
The user code actively suspends the current task to give up the CPU
Multi-tasking seizes resources because it is not available And was suspended
Hardware Interrupt

##1.2 Memory

From the perspective of the operating system, focus on whether the memory is sufficient for the application process. You can use the free -m command to check the memory usage.

Using the top command, you can view the virtual memory VIRT and physical memory RES used by the process. According to the formula VIRT = SWAP RES, you can calculate the swap partition (Swap) used by the specific application. Using too much swap partition will affect application performance. , you can adjust the swappiness value as small as possible.

##1.3 I/O

I/O includes disk I/O O and network I/O, disks are generally more prone to I/O bottlenecks. You can check the read and write status of the disk through

iostat, and you can see whether the disk I/O is normal through the CPU's I/O wait. If the disk I/O is always in a high state, it means that the disk is too slow or faulty and has become a performance bottleneck. Application optimization or disk replacement is required.

In addition to the commonly used commands such as top, ps, vmstat, iostat, etc., there are other Linux tools that can diagnose system problems, such as mpstat, tcpdump, netstat, pidstat, sar and other Linux performance diagnostic tools as shown below:

For golang performance diagnosis, it is enough to read this article

GO Application Diagnostics

#The go ecosystem has provided us with a large number of APIs and diagnostic tools to help us solve performance problems in go applications. The ones we commonly use can be roughly divided into two categories:

##Profiling collects specific events during program execution and collects sampling statistics to facilitate precise location of problems

Tracing A method of instrumenting code to analyze latency throughout the life cycle of a call or user request, and can span multiple Go processes.

2.1 profiling

profile is generally called performance Analysis, for a program, is various overview information when the program is running, including CPU usage, memory status, thread status, etc. Convenient to analyze expensive or frequently called program scenarios.

how to use?

1. First profiling code is buried in

import _ "net/http/pprof"


func main() {
    go func() {
        log.Println(http.ListenAndServe("0.0.0.0:9090", nil))
    }()
    ...
}

Copy after login

2. Save the profile at a specific point in time, such as saving heap information

curl http://localhost:6060/debug/pprof/heap --output heap.tar.gz

Copy after login

3. Use go tool pprof to analyze the saved profile snapshot, such as analyzing the above heap information

go tool pprof heap.tar.gz

Copy after login

#2.1. 1 CPU Profiling

pprof can help us analyze the problem of slow function execution

CThe problem of excessive PU usage

go tool pprof http://localhost:6060/debug/pprof/profile?second=10

Copy after login

Command line mode: Common command top list traces

top: View the top 10 function information sorted by memory or CPU usage

flat: The CPU time occupied by the current function (
excluding other functions it calls)
flat% : The percentage of CPU used by the current function to the total CPU time
sum%: The sum of the flat percentages of each previous line
cum: The cumulative amount, How long the current function and its sub-functions occupy the CPU
cum%: The cumulative amount accounts for the percentage of the total amount

cum>=flat

list: 查看某个函数的代码以及该函数每行代码的指标信息

traces:打印所有函数调用栈以及调用栈的指标信息

For golang performance diagnosis, it is enough to read this article

UI界面方式：从服务器download下生成的sample文件

go tool pprof -http=:8080 pprof.xxx.samples.cpu.001.pb.gz

Copy after login

For golang performance diagnosis, it is enough to read this article

Flame graph很清晰得可以看到当前CPU被哪些函数执行栈占用

1.2 Heap Profiling

go tool pprof http://localhost:6060/debug/pprof/heap?second=10

Copy after login

命令行 UI查看方式同理

For golang performance diagnosis, it is enough to read this article

graph中方框越大占用内存越多火焰图宽度越大占用内存越多

SAMPLE->inuse_objects可以查看当前的对象数量这个参数对于分析gc线程占用较高cpu时很有用处它侧重查看对象数量

inuse_space图可以查看具体的内存占用

毕竟对于10个100m的对象和1亿个10字节的对象占用内存几乎一样大，但是回收起来一亿个小对象肯定比10个大对象要慢很多。

go tool pprof -inuse_space http://localhost:6060/debug/pprof/heap : 分析应用程序的常驻内存占用情况 （默认）
go tool pprof -alloc_objects http://localhost:6060/debug/pprof/heap: 分析应用程序的内存临时分配情况

Copy after login

1.3 并发请求问题查看方式跟上面类似。

go tool pprof http://localhost:6060/debug/pprof/goroutine
go tool pprof http://localhost:6060/debug/pprof/block
go tool pprof http://localhost:6060/debug/pprof/mutex

Copy after login

2.2 tracing

trace并不是万能的，它更侧重于记录分析采样时间内运行时系统具体干了什么。

收集trace数据的三种方式:

1. 使用runtime/trace包调用trace.Start()和trace.Stop()

2. 使用go test -trace=测试标识

3. 使用debug/pprof/trace handler 获取运行时系统最好的方法

例如，通过

go tool pprof http://localhost:6060/debug/pprof/trace?seconds=20 > trace.out

Copy after login

获取运行时服务的trace信息，使用

go tool trace trace.out

Copy after login

会自动打开浏览器展示出UI界面

For golang performance diagnosis, it is enough to read this article

其中trace view 只能使用chrome浏览器查看，这里go截止1.14版本存在一个 bug，解决办法如下：

go tool trace trace.out 无法查看trace view
go bug:https://github.com/golang/go/issues/25151
mac 解决版本：安装gotip
go get golang.org/dl/gotip
gotip download
then  使用 gotip tool trace trace.out即可

Copy after login

获取的trace.out 二进制文件也可以转化为pprof格式的文件

go tool trace -pprof=TYPE trace.out > TYPE.pprof
Tips:生成的profile文件 支持 network profiling、synchronization profiling、syscall profiling、scheduler profiling
go tool pprof TYPE.pprof

Copy after login

使用gotip tool trace trace.out可以查看到trace view的丰富操作界面：

操作技巧：

ctrl + 1 选择信息

ctrl + 2 移动选区

ctrl + 3 放大选区

ctrl + 4 指定选区区间

shift + ? 帮助信息

AWSD跟游戏快捷键类似玩起来跟顺手

整体的控制台信息如下图：

For golang performance diagnosis, it is enough to read this article

时间线: 显示执行的时间单元根据时间的纬度不同可以调整区间
堆: 显示执行期间内存的分配和释放情况
协程(Goroutine): 显示每个时间点哪些Goroutine在运行哪些goroutine等待调度，其包含 GC 等待（GCWaiting）、可运行（Runnable）、运行中（Running）这三种状态。

goroutine区域选中时间区间

For golang performance diagnosis, it is enough to read this article

OS线程(Machine): 显示在执行期间有多少个线程在运行，其包含正在调用 Syscall（InSyscall）、运行中（Running）这两种状态。

For golang performance diagnosis, it is enough to read this article

虚拟处理器Processor: 每个虚拟处理器显示一行，虚拟处理器的数量一般默认为系统内核数。数量由环境变量GOMAXPROCS控制
协程和事件: 显示在每个虚拟处理器上有什么 Goroutine 正在运行，而连线行为代表事件关联。

每个Processor分两层，上一层表示Processor上运行的goroutine的信息，下一层表示processor附加的事件比如SysCall 或runtime system events

For golang performance diagnosis, it is enough to read this article

ctrl+3 放大选区，选中goroutine 可以查看，特定时间点特定goroutine的执行堆栈信息以及关联的事件信息

For golang performance diagnosis, it is enough to read this article

goroutine analysis

For golang performance diagnosis, it is enough to read this article

点击goroutine的id 可以跳到trace view 详细查看goroutine具体干了什么

名称	含义
Execution	执行时间
Network wait	网络等待时间
Sync Block	同步阻塞时间
Blocking syscall	系统调用阻塞时间
Scheduler wait	调度等待时间
GC Sweeping	GC清扫时间
GC Pause	GC暂停时间

实践一个延迟问题诊断

当我们一个执行关键任务的协程从运行中被阻塞。这里可能的原因：被syscall阻塞、阻塞在共享内存(channel/mutex etc)、阻塞在运行时(如 GC)、甚至有可能是运行时调度器不工作导致的。这种问题使用pprof很难排查，

使用trace只要我们确定了时间范围就可以在proc区域很容易找到问题的源头

For golang performance diagnosis, it is enough to read this article

上图可见，GC 的MARK阶段阻塞了主协程的运行

2.3 GC

golang的gc算法是根据标记清除改进的三色标记法，大概流程:

初始所有对象都是白色

Stack scan阶段:从root出发扫描所有可达对象，标记为灰色并放入待处理队列；root包括全局指针和goroutine栈上的指针
Mark阶段:1.从待处理队列取出灰色对象，将其引用的对象标记为灰色并放入队列，自身标记为黑色 2. re-scan全局指针和栈，因为mark和用户程序并行运行，故过程1的时候可能会有新的对象分配，这时需要通过写屏障(write barrier)记录下来；re-scan再完成检查；
重复步骤Mark阶段，直到灰色对象队列为空，执行清扫工作（白色即为垃圾对象）

go的三色标记法也存在STW(Stop The World),大致有两个场景

GC即将开始时，需要STW 做一些准备工作，如enable write barrier
re-scan也需要STW，否则上面Mark阶段的re-scan无法终止

通过GODEBUG=gctrace=1可以开启gc日志，查看gc的结果信息

$ GODEBUG=gctrace=1 go run main.go   
gc 1 @0.001s 19%: 0.014+3.7+0.015 ms clock, 0.11+2.8/5.7/3.2+0.12 ms cpu, 5->6->6 MB, 6 MB goal, 8 P
gc 2 @0.024s 6%: 0.004+3.4+0.010 ms clock, 0.032+1.4/4.5/5.3+0.085 ms cpu, 13->14->13 MB, 14 MB goal, 8 P
gc 3 @0.093s 3%: 0.004+6.1+0.027 ms clock, 0.032+0.19/11/15+0.22 ms cpu, 24->25->22 MB, 26 MB goal, 8 P
scvg: 0 MB released
scvg: inuse: 4, idle: 58, sys: 63, released: 58, consumed: 4 (MB)
scvg: 0 MB released
scvg: inuse: 4, idle: 58, sys: 63, released: 58, consumed: 4 (MB)
scvg: 0 MB released
scvg: inuse: 4, idle: 58, sys: 63, released: 58, consumed: 4 (MB)
scvg: 0 MB released
scvg: inuse: 4, idle: 58, sys: 63, released: 58, consumed: 4 (MB)

Copy after login

格式

gc # @#s #%: #+#+# ms clock, #+#/#/#+# ms cpu, #->#-># MB, # MB goal, # P

含义

gc#：GC 执行次数的编号，每次叠加。

@#s：自程序启动后到当前的具体秒数。

#%：自程序启动以来在GC中花费的时间百分比。

#+...+#：GC 的标记工作共使用的 CPU 时间占总 CPU 时间的百分比。

#->#-># MB：分别表示 GC 启动时, GC 结束时, GC 活动时的堆大小.

#MB goal：下一次触发 GC 的内存占用阈值。

#P：当前使用的处理器 P 的数量。

拓展

当我们的程序陷入CPU 和IO混和负载过高时，我们使用pprof profile只能检测出CPU耗时的函数，但是屏蔽了IO等待过长的函数。

https://github.com/felixge/fgprof 给出了一个解决方案：

具体做法是:用一个后台协程在采样时间区间内每秒99次调用runtime.GoruntineProfile，返回的结果忽略了协程当时消耗CPU还是非消耗CPU的区别进行统计，保存在内存中的map中，可导出转化为pprof

具体用法：

package main
 
import(
    _ "net/http/pprof"
    "github.com/felixge/fgprof"
)
 
func main() {
    http.DefaultServeMux.Handle("/debug/fgprof", fgprof.Handler())
    go func() {
        log.Println(http.ListenAndServe(":6060", nil))
    }()
 
    // <code to profile>
}
 
git clone https://github.com/brendangregg/FlameGraph
cd FlameGraph
curl -s &#39;localhost:6060/debug/fgprof?seconds=3&#39; > fgprof.fold
./flamegraph.pl fgprof.fold > fgprof.svg

Copy after login

如果遇到这种CPU消耗型和非CPU消耗型混合的情况下可以试试排查下。

The above is the detailed content of For golang performance diagnosis, it is enough to read this article. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

4 weeks ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7577

CakePHP Tutorial

1386

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

111

Related knowledge

How to safely read and write files using Golang? Jun 06, 2024 pm 05:14 PM

Reading and writing files safely in Go is crucial. Guidelines include: Checking file permissions Closing files using defer Validating file paths Using context timeouts Following these guidelines ensures the security of your data and the robustness of your application.

How to configure connection pool for Golang database connection? Jun 06, 2024 am 11:21 AM

How to configure connection pooling for Go database connections? Use the DB type in the database/sql package to create a database connection; set MaxOpenConns to control the maximum number of concurrent connections; set MaxIdleConns to set the maximum number of idle connections; set ConnMaxLifetime to control the maximum life cycle of the connection.

How to save JSON data to database in Golang? Jun 06, 2024 am 11:24 AM

JSON data can be saved into a MySQL database by using the gjson library or the json.Unmarshal function. The gjson library provides convenience methods to parse JSON fields, and the json.Unmarshal function requires a target type pointer to unmarshal JSON data. Both methods require preparing SQL statements and performing insert operations to persist the data into the database.

Golang framework vs. Go framework: Comparison of internal architecture and external features Jun 06, 2024 pm 12:37 PM

The difference between the GoLang framework and the Go framework is reflected in the internal architecture and external features. The GoLang framework is based on the Go standard library and extends its functionality, while the Go framework consists of independent libraries to achieve specific purposes. The GoLang framework is more flexible and the Go framework is easier to use. The GoLang framework has a slight advantage in performance, and the Go framework is more scalable. Case: gin-gonic (Go framework) is used to build REST API, while Echo (GoLang framework) is used to build web applications.

How to find the first substring matched by a Golang regular expression? Jun 06, 2024 am 10:51 AM

The FindStringSubmatch function finds the first substring matched by a regular expression: the function returns a slice containing the matching substring, with the first element being the entire matched string and subsequent elements being individual substrings. Code example: regexp.FindStringSubmatch(text,pattern) returns a slice of matching substrings. Practical case: It can be used to match the domain name in the email address, for example: email:="user@example.com", pattern:=@([^\s]+)$ to get the domain name match[1].

Transforming from front-end to back-end development, is it more promising to learn Java or Golang? Apr 02, 2025 am 09:12 AM

Backend learning path: The exploration journey from front-end to back-end As a back-end beginner who transforms from front-end development, you already have the foundation of nodejs,...

How to use predefined time zone with Golang? Jun 06, 2024 pm 01:02 PM

Using predefined time zones in Go includes the following steps: Import the "time" package. Load a specific time zone through the LoadLocation function. Use the loaded time zone in operations such as creating Time objects, parsing time strings, and performing date and time conversions. Compare dates using different time zones to illustrate the application of the predefined time zone feature.

Golang framework development practical tutorial: FAQs Jun 06, 2024 am 11:02 AM

Go framework development FAQ: Framework selection: Depends on application requirements and developer preferences, such as Gin (API), Echo (extensible), Beego (ORM), Iris (performance). Installation and use: Use the gomod command to install, import the framework and use it. Database interaction: Use ORM libraries, such as gorm, to establish database connections and operations. Authentication and authorization: Use session management and authentication middleware such as gin-contrib/sessions. Practical case: Use the Gin framework to build a simple blog API that provides POST, GET and other functions.

See all articles