Use C++ to analyze C++ syntax format
Foreword
Recently, C++ projects require the use of scripts, which is related to how to bind objects to the script running environment. Because multiple scripting languages are used, existing binding technologies cannot meet the needs. Therefore, we can only seek to parse the C++ header file and then bind according to the description. At first, I found that boost has a set, but boost is too bloated and has certain requirements for the establishment of the development environment after the project. I strive for the simplest project management, and boost is not suitable. Write your own set.
Text
Summary of the syntax format of C++
Except for compiler directives starting with # and functions, all must end with a semicolon
Code blocks must be enclosed in braces/flowers except if, do/while, and for in a single statement Brackets "{}"
There are 8 types of code blocks, namespace, global, class, structure, global function, member function, lambda, unnamed code block within a function
The namespace is the same as the global, only add "name" before the declaration Space::” prefix
The difference between class declaration block and global block is friend declaration, member access rights
Templates support classes and functions
Functions and templates have parameter lists
The declaration ending with a semicolon can be a built-in type (int, double, etc.), classes, template classes, typedef types, function pointers, lambda
functions can have the same name
Member functions of a class that access other members can be declared after the function is defined, unlike global functions It must have been declared, so it is impossible to use a pointer to scan whether the C++ file is legal. If it is a classed enumeration (enum class), it does not need to be visible to the scope where it is declared, otherwise the members must be added to the scope at the same time.
I don’t plan to support it. It is not necessary for the requirements.
Namespace
Template
Function body
I don’t want to support multiple variables separated by commas. I don’t want to support
Type verification
Default parameters
union
Enumeration does not judge name conflicts
lambda
Variable names cannot start with numbers
Inheritance of classes
Function pointers
Principles of development
Not cross-platform yet, only VS, do not use system APIs like this Suitable for modification to cross-platform
Using C++
iter only advances but never retreats
When encountering a syntax error or the end of the file, an exception is thrown
When encountering {, it enters block processing
processing; the ending language unit, the function must be itself After processing; and the previous content, return
The member function definition code is not analyzed, because the first principle cannot be achieved, the member function may refer to other member functions that have not been declared yet
It is not the best performance, but it can This structure is optimized to the extreme
We do not seek complete analysis, but on this framework we can analyze all features of C++11 and higher versions
No comments, English is not good, and Chinese comments are not suitable for globalization
Structure description
variant, method, type, comment, enumeration, enumeration_value inherit from object, they can all belong to the global or class or structure.
document represents a c++ compilation unit.
context represents a context, which is a queue that can be searched upward. After the document completes parsing, the parsed variables, functions, and types are stored.
reader, file reader, a forward char iterator, can be replaced by istreambuf_iterator
Use
main function has _DEBUG macro protection, it is recommended to compile it as a library release.
#include
try
{
auto result = cpp_analysis::analysis("[cpp_header_file]");
// todo
}
catch (logic_error& e){
// todo:
Project location
https://github.com/FettLuo/cpp_analysis[open in new tab]

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



In C language, special characters are processed through escape sequences, such as: \n represents line breaks. \t means tab character. Use escape sequences or character constants to represent special characters, such as char c = '\n'. Note that the backslash needs to be escaped twice. Different platforms and compilers may have different escape sequences, please consult the documentation.

The usage methods of symbols in C language cover arithmetic, assignment, conditions, logic, bit operators, etc. Arithmetic operators are used for basic mathematical operations, assignment operators are used for assignment and addition, subtraction, multiplication and division assignment, condition operators are used for different operations according to conditions, logical operators are used for logical operations, bit operators are used for bit-level operations, and special constants are used to represent null pointers, end-of-file markers, and non-numeric values.

In C, the char type is used in strings: 1. Store a single character; 2. Use an array to represent a string and end with a null terminator; 3. Operate through a string operation function; 4. Read or output a string from the keyboard.

The difference between multithreading and asynchronous is that multithreading executes multiple threads at the same time, while asynchronously performs operations without blocking the current thread. Multithreading is used for compute-intensive tasks, while asynchronously is used for user interaction. The advantage of multi-threading is to improve computing performance, while the advantage of asynchronous is to not block UI threads. Choosing multithreading or asynchronous depends on the nature of the task: Computation-intensive tasks use multithreading, tasks that interact with external resources and need to keep UI responsiveness use asynchronous.

The char array stores character sequences in C language and is declared as char array_name[size]. The access element is passed through the subscript operator, and the element ends with the null terminator '\0', which represents the end point of the string. The C language provides a variety of string manipulation functions, such as strlen(), strcpy(), strcat() and strcmp().

In C language, char type conversion can be directly converted to another type by: casting: using casting characters. Automatic type conversion: When one type of data can accommodate another type of value, the compiler automatically converts it.

In C language, the main difference between char and wchar_t is character encoding: char uses ASCII or extends ASCII, wchar_t uses Unicode; char takes up 1-2 bytes, wchar_t takes up 2-4 bytes; char is suitable for English text, wchar_t is suitable for multilingual text; char is widely supported, wchar_t depends on whether the compiler and operating system support Unicode; char is limited in character range, wchar_t has a larger character range, and special functions are used for arithmetic operations.

There is no built-in sum function in C language, so it needs to be written by yourself. Sum can be achieved by traversing the array and accumulating elements: Loop version: Sum is calculated using for loop and array length. Pointer version: Use pointers to point to array elements, and efficient summing is achieved through self-increment pointers. Dynamically allocate array version: Dynamically allocate arrays and manage memory yourself, ensuring that allocated memory is freed to prevent memory leaks.
