Home Backend Development C#.Net Tutorial Use C++ to analyze C++ syntax format

Use C++ to analyze C++ syntax format

Nov 21, 2016 am 11:50 AM
analysis

Foreword

Recently, C++ projects require the use of scripts, which is related to how to bind objects to the script running environment. Because multiple scripting languages ​​are used, existing binding technologies cannot meet the needs. Therefore, we can only seek to parse the C++ header file and then bind according to the description. At first, I found that boost has a set, but boost is too bloated and has certain requirements for the establishment of the development environment after the project. I strive for the simplest project management, and boost is not suitable. Write your own set.

Text

Summary of the syntax format of C++

Except for compiler directives starting with # and functions, all must end with a semicolon

Code blocks must be enclosed in braces/flowers except if, do/while, and for in a single statement Brackets "{}"

There are 8 types of code blocks, namespace, global, class, structure, global function, member function, lambda, unnamed code block within a function

The namespace is the same as the global, only add "name" before the declaration Space::” prefix

The difference between class declaration block and global block is friend declaration, member access rights

Templates support classes and functions

Functions and templates have parameter lists

The declaration ending with a semicolon can be a built-in type (int, double, etc.), classes, template classes, typedef types, function pointers, lambda

functions can have the same name

Member functions of a class that access other members can be declared after the function is defined, unlike global functions It must have been declared, so it is impossible to use a pointer to scan whether the C++ file is legal. If it is a classed enumeration (enum class), it does not need to be visible to the scope where it is declared, otherwise the members must be added to the scope at the same time.

I don’t plan to support it. It is not necessary for the requirements.

Namespace

Template

Function body

I don’t want to support multiple variables separated by commas. I don’t want to support

Type verification

Default parameters

union

Enumeration does not judge name conflicts

lambda

Variable names cannot start with numbers

Inheritance of classes

Function pointers

Principles of development

Not cross-platform yet, only VS, do not use system APIs like this Suitable for modification to cross-platform

Using C++

iter only advances but never retreats

When encountering a syntax error or the end of the file, an exception is thrown

When encountering {, it enters block processing

processing; the ending language unit, the function must be itself After processing; and the previous content, return

The member function definition code is not analyzed, because the first principle cannot be achieved, the member function may refer to other member functions that have not been declared yet

It is not the best performance, but it can This structure is optimized to the extreme

We do not seek complete analysis, but on this framework we can analyze all features of C++11 and higher versions

No comments, English is not good, and Chinese comments are not suitable for globalization

Structure description

variant, method, type, comment, enumeration, enumeration_value inherit from object, they can all belong to the global or class or structure.

document represents a c++ compilation unit.

context represents a context, which is a queue that can be searched upward. After the document completes parsing, the parsed variables, functions, and types are stored.

reader, file reader, a forward char iterator, can be replaced by istreambuf_iterator(ifstream).

Use

main function has _DEBUG macro protection, it is recommended to compile it as a library release.

#include

try

{

auto result = cpp_analysis::analysis("[cpp_header_file]");


 // todo

}

catch (logic_error& e)

{

// todo:

}


Project location

https://github.com/FettLuo/cpp_analysis[open in new tab]

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to handle special characters in C language How to handle special characters in C language Apr 03, 2025 pm 03:18 PM

In C language, special characters are processed through escape sequences, such as: \n represents line breaks. \t means tab character. Use escape sequences or character constants to represent special characters, such as char c = '\n'. Note that the backslash needs to be escaped twice. Different platforms and compilers may have different escape sequences, please consult the documentation.

How to use various symbols in C language How to use various symbols in C language Apr 03, 2025 pm 04:48 PM

The usage methods of symbols in C language cover arithmetic, assignment, conditions, logic, bit operators, etc. Arithmetic operators are used for basic mathematical operations, assignment operators are used for assignment and addition, subtraction, multiplication and division assignment, condition operators are used for different operations according to conditions, logical operators are used for logical operations, bit operators are used for bit-level operations, and special constants are used to represent null pointers, end-of-file markers, and non-numeric values.

What is the role of char in C strings What is the role of char in C strings Apr 03, 2025 pm 03:15 PM

In C, the char type is used in strings: 1. Store a single character; 2. Use an array to represent a string and end with a null terminator; 3. Operate through a string operation function; 4. Read or output a string from the keyboard.

The difference between multithreading and asynchronous c# The difference between multithreading and asynchronous c# Apr 03, 2025 pm 02:57 PM

The difference between multithreading and asynchronous is that multithreading executes multiple threads at the same time, while asynchronously performs operations without blocking the current thread. Multithreading is used for compute-intensive tasks, while asynchronously is used for user interaction. The advantage of multi-threading is to improve computing performance, while the advantage of asynchronous is to not block UI threads. Choosing multithreading or asynchronous depends on the nature of the task: Computation-intensive tasks use multithreading, tasks that interact with external resources and need to keep UI responsiveness use asynchronous.

How to use char array in C language How to use char array in C language Apr 03, 2025 pm 03:24 PM

The char array stores character sequences in C language and is declared as char array_name[size]. The access element is passed through the subscript operator, and the element ends with the null terminator '\0', which represents the end point of the string. The C language provides a variety of string manipulation functions, such as strlen(), strcpy(), strcat() and strcmp().

How to convert char in C language How to convert char in C language Apr 03, 2025 pm 03:21 PM

In C language, char type conversion can be directly converted to another type by: casting: using casting characters. Automatic type conversion: When one type of data can accommodate another type of value, the compiler automatically converts it.

The difference between char and wchar_t in C language The difference between char and wchar_t in C language Apr 03, 2025 pm 03:09 PM

In C language, the main difference between char and wchar_t is character encoding: char uses ASCII or extends ASCII, wchar_t uses Unicode; char takes up 1-2 bytes, wchar_t takes up 2-4 bytes; char is suitable for English text, wchar_t is suitable for multilingual text; char is widely supported, wchar_t depends on whether the compiler and operating system support Unicode; char is limited in character range, wchar_t has a larger character range, and special functions are used for arithmetic operations.

What is the function of C language sum? What is the function of C language sum? Apr 03, 2025 pm 02:21 PM

There is no built-in sum function in C language, so it needs to be written by yourself. Sum can be achieved by traversing the array and accumulating elements: Loop version: Sum is calculated using for loop and array length. Pointer version: Use pointers to point to array elements, and efficient summing is achieved through self-increment pointers. Dynamically allocate array version: Dynamically allocate arrays and manage memory yourself, ensuring that allocated memory is freed to prevent memory leaks.

See all articles