Go函數閉包底層實現-Golang-PHP中文網

函數閉包對大多數讀者而言並不是什麼高階詞彙，那什麼是閉包呢？這裡摘自Wiki上的定義：

a closure is a record storing a function together with an environment.The environment is a mapping associating each free variable of the function (variables that are used locally, but defined in an enclosing scope) with the value or reference to which the name was bound when the closure was created.

####1而言之，閉包是一個由函數和引用環境而組合的實體。閉包在實作過程中，往往是透過呼叫外部函數傳回其內部函數來實現的。在這其中，引用環境是指的外部函數中自由變數（內部函數使用，但定義於外部函數中）的映射。內部函數透過引入外部的自由變量，使得這些變數即使離開了外部函數的環境也不會被釋放或刪除，在返回的內部函數仍然持有這些資訊。 ###########################這段話可能不太理解，我們直接用看例子。 ######

 1package main
 2
 3import "fmt"
 4
 5func outer() func() int {
 6    x := 1
 7    return func() int {
 8        x++
 9        return x
10    }
11}
12
13func main() {
14    closure := outer()
15    fmt.Println(closure())
16    fmt.Println(closure())
17}
18
19// output
202
213

登入後複製

######可以看到，Go中的兩個特性（函數是一等公民，支援匿名函數）使其很容易實現閉包。 ######

在上面的範例中，closure#是閉包函數，變數x就是引用環境，它們的組合就是閉包實體。 x本來是outer函數之內，匿名函數以外的局部變數。在正常函數呼叫結束之後，x就會隨著函數堆疊的銷毀而被銷毀。但由於匿名函數的引用，outer傳回的函數物件會一直持有x#變數。這造成了每次呼叫閉包closure，x##變數都會得到累加。

這裡和普通的函數呼叫不一樣：局部變數x並沒有隨著函數的呼叫結束而消失。那麼，這是為什麼呢？

实现原理

我们不妨从汇编入手，将上述代码稍微修改一下

 1package main
 2
 3func outer() func() int {
 4    x := 1
 5    return func() int {
 6        x++
 7        return x
 8    }
 9}
10
11func main() {
12    _ := outer()
13}

登入後複製

得到编译后的汇编语句如下。

 1$ go tool compile -S -N -l main.go 
 2"".outer STEXT size=181 args=0x8 locals=0x28
 3        0x0000 00000 (main.go:3)        TEXT    "".outer(SB), ABIInternal, $40-8
 4        ...
 5        0x0021 00033 (main.go:3)        MOVQ    $0, "".~r0+48(SP)
 6        0x002a 00042 (main.go:4)        LEAQ    type.int(SB), AX
 7        0x0031 00049 (main.go:4)        MOVQ    AX, (SP)
 8        0x0035 00053 (main.go:4)        PCDATA  $1, $0
 9        0x0035 00053 (main.go:4)        CALL    runtime.newobject(SB)
10        0x003a 00058 (main.go:4)        MOVQ    8(SP), AX
11        0x003f 00063 (main.go:4)        MOVQ    AX, "".&x+24(SP)
12        0x0044 00068 (main.go:4)        MOVQ    $1, (AX)
13        0x004b 00075 (main.go:5)        LEAQ    type.noalg.struct { F uintptr; "".x *int }(SB), AX
14        0x0052 00082 (main.go:5)        MOVQ    AX, (SP)
15        0x0056 00086 (main.go:5)        PCDATA  $1, $1
16        0x0056 00086 (main.go:5)        CALL    runtime.newobject(SB)
17        0x005b 00091 (main.go:5)        MOVQ    8(SP), AX
18        0x0060 00096 (main.go:5)        MOVQ    AX, ""..autotmp_4+16(SP)
19        0x0065 00101 (main.go:5)        LEAQ    "".outer.func1(SB), CX
20        0x006c 00108 (main.go:5)        MOVQ    CX, (AX)
21        ...

登入後複製

首先，我们发现不一样的是 x:=1 会调用 runtime.newobject 函数（内置new函数的底层函数，它返回数据类型指针）。在正常函数局部变量的定义时，例如

 1package main
 2
 3func add() int {
 4    x := 100
 5    x++
 6    return x
 7}
 8
 9func main() {
10    _ = add()
11}

登入後複製

我们能发现 x:=100 是不会调用 runtime.newobject 函数的，它对应的汇编是如下

1"".add STEXT nosplit size=58 args=0x8 locals=0x10
2        0x0000 00000 (main.go:3)        TEXT    "".add(SB), NOSPLIT|ABIInternal, $16-8
3        ...
4        0x000e 00014 (main.go:3)        MOVQ    $0, "".~r0+24(SP)
5        0x0017 00023 (main.go:4)        MOVQ    $100, "".x(SP)  // x:=100
6        0x001f 00031 (main.go:5)        MOVQ    $101, "".x(SP)
7        0x0027 00039 (main.go:6)        MOVQ    $101, "".~r0+24(SP)
8        ...

登入後複製

留着疑问，继续往下看。我们发现有以下语句

1        0x004b 00075 (main.go:5)        LEAQ    type.noalg.struct { F uintptr; "".x *int }(SB), AX
2        0x0052 00082 (main.go:5)        MOVQ    AX, (SP)
3        0x0056 00086 (main.go:5)        PCDATA  $1, $1
4        0x0056 00086 (main.go:5)        CALL    runtime.newobject(SB)

登入後複製

我们看到 type.noalg.struct { F uintptr; "".x *int }(SB)，这其实就是定义的一个闭包数据类型，它的结构表示如下

1type closure struct {
2    F uintptr   // 函数指针，代表着内部匿名函数
3    x *int      // 自由变量x，代表着对外部环境的引用
4}

登入後複製

之后，在通过 runtime.newobject 函数创建了闭包对象。而且由于 LEAQ xxx yyy代表的是将 xxx 指针，传递给 yyy，因此 outer 函数最终的返回，其实是闭包结构体对象指针。在《详解逃逸分析》一文中，我们详细地描述了Go编译器的逃逸分析机制，对于这种函数返回暴露给外部指针的情况，很明显，闭包对象会被分配至堆上，变量x也会随着对象逃逸至堆。这就很好地解释了为什么x变量没有随着函数栈的销毁而消亡。

我们可以通过逃逸分析来验证我们的结论

 1$  go build -gcflags &#39;-m -m -l&#39; main.go
 2# command-line-arguments
 3./main.go:6:3: outer.func1 capturing by ref: x (addr=true assign=true width=8)
 4./main.go:5:9: func literal escapes to heap:
 5./main.go:5:9:   flow: ~r0 = &{storage for func literal}:
 6./main.go:5:9:     from func literal (spill) at ./main.go:5:9
 7./main.go:5:9:     from return func literal (return) at ./main.go:5:2
 8./main.go:4:2: x escapes to heap:
 9./main.go:4:2:   flow: {storage for func literal} = &x:
10./main.go:4:2:     from func literal (captured by a closure) at ./main.go:5:9
11./main.go:4:2:     from x (reference) at ./main.go:6:3
12./main.go:4:2: moved to heap: x                   // 变量逃逸
13./main.go:5:9: func literal escapes to heap       // 函数逃逸

登入後複製

至此，我相信读者已经明白为什么闭包能持续持有外部变量的原因了。那么，我们来思考上文中留下的疑问，为什么在x:=1 时会调用 runtime.newobject 函数。

我们将上文中的例子改为如下，即删掉 x++ 代码

 1package main
 2
 3func outer() func() int {
 4    x := 1
 5    return func() int {
 6        return x
 7    }
 8}
 9
10func main() {
11    _ = outer()
12}

登入後複製

此时，x:=1处的汇编代码，将不再调用 runtime.newobject 函数，通过逃逸分析也会发现将x不再逃逸，生成的闭包对象中的x的将是值类型int

1type closure struct {
2    F uintptr 
3    x int      
4}

登入後複製

这其实就是Go编译器做得精妙的地方：当闭包内没有对外部变量造成修改时，Go 编译器会将自由变量的引用传递优化为直接值传递，避免变量逃逸。

总结

函数闭包一点也不神秘，它就是函数和引用环境而组合的实体。在Go中，闭包在底层是一个结构体对象，它包含了函数指针与自由变量。

Go編譯器的逃逸分析機制，會將閉包物件分配到堆中，這樣自由變數就不會隨著函數堆疊的銷毀而消失，它能依附著閉包實體而一直存在。因此，閉包使用的優缺點是很明顯的：閉包能夠避免使用全局變量，轉而維持自由變量長期存儲在內存之中；但是，這種隱式地持有自由變量，在使用不當時，會很容易造成記憶體浪費與外洩。

在實際的專案中，閉包的使用場景並不多。當然，如果你的程式碼中寫了閉包，例如你寫的某回呼函數形成了閉包，那就需要謹慎一些，否則記憶體的使用問題也許會給你帶來麻煩。

以上是Go函數閉包底層實現的詳細內容。更多資訊請關注PHP中文網其他相關文章！