Home Backend Development C++ How Can We Accurately Capture Function Exit Times for Performance Profiling on Embedded Systems?

How Can We Accurately Capture Function Exit Times for Performance Profiling on Embedded Systems?

Dec 18, 2024 am 11:35 AM

How Can We Accurately Capture Function Exit Times for Performance Profiling on Embedded Systems?

Capturing Function Exit Time with __gnu_mcount_nc

In an attempt to perform performance profiling on an embedded platform, implementing a function that solely records the stack frame and current cycle count for each function entry resulted in useful insights regarding caller/callee graphs and frequently utilized functions. However, the lack of visibility into function exit times posed a challenge for capturing the complete time spent within function bodies.

GNU Profiling Tool Approach

In contrast to the aforementioned implementation, GNU profiling tools like gprof overcome this limitation by utilizing stack sampling. Instead of relying on function entry and exit timing, gprof measures the self-time of each function by counting PC samples within it. This self-time is then distributed among callers based on the function-to-function call counts.

Advantages of Stack Sampling

Compared to PC sampling, stack sampling provides several advantages:

  • Accuracy: Stack sampling eliminates uncertainty arising from short function calls and library routines not compiled with -pg.
  • Efficiency: Capturing stack samples is more expensive than PC samples, but fewer samples are required for accurate profiling.
  • Robustness: Stack sampling is not impacted by recursion and works effectively in multithreaded/multicore environments.

Alternatives to Call-Graphs and Hot-Spots

While call-graphs and hot-spots can provide some insights, they may not expose hidden performance issues. For effective profiling, it is recommended to examine random raw stack samples to identify functions that are responsible for excessive time consumption and why they are being called. This approach provides a deeper understanding of the code structure and potential areas for optimization.

The above is the detailed content of How Can We Accurately Capture Function Exit Times for Performance Profiling on Embedded Systems?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
1 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
1 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Article Tags

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

What are the types of values ​​returned by c language functions? What determines the return value? What are the types of values ​​returned by c language functions? What determines the return value? Mar 03, 2025 pm 05:52 PM

What are the types of values ​​returned by c language functions? What determines the return value?

Gulc: C library built from scratch Gulc: C library built from scratch Mar 03, 2025 pm 05:46 PM

Gulc: C library built from scratch

What are the definitions and calling rules of c language functions and what are the What are the definitions and calling rules of c language functions and what are the Mar 03, 2025 pm 05:53 PM

What are the definitions and calling rules of c language functions and what are the

C language function format letter case conversion steps C language function format letter case conversion steps Mar 03, 2025 pm 05:53 PM

C language function format letter case conversion steps

Where is the return value of the c language function stored in memory? Where is the return value of the c language function stored in memory? Mar 03, 2025 pm 05:51 PM

Where is the return value of the c language function stored in memory?

distinct usage and phrase sharing distinct usage and phrase sharing Mar 03, 2025 pm 05:51 PM

distinct usage and phrase sharing

How do I use algorithms from the STL (sort, find, transform, etc.) efficiently? How do I use algorithms from the STL (sort, find, transform, etc.) efficiently? Mar 12, 2025 pm 04:52 PM

How do I use algorithms from the STL (sort, find, transform, etc.) efficiently?

How does the C   Standard Template Library (STL) work? How does the C Standard Template Library (STL) work? Mar 12, 2025 pm 04:50 PM

How does the C Standard Template Library (STL) work?

See all articles