Solution to Linux socket garbled code: 1. Open the terminal command window; 2. Use the iconv command to convert the Linux file character set. The conversion command is such as "iconv -f utf-8 -t gb2312 /server_test/reports/ software_.txt > /server_test/reports/software_asserts.txt".
#The operating environment of this tutorial: linux5.9.8 system, Dell G3 computer.
What should I do if the Linux socket is garbled?
linux, windows, socket garbled problem
Problem description:
There is a ftp client written by myself in linux, and there is a built-in ftp client in windows. The ftp server built by the software, when I create a directory containing Chinese characters or a file name containing Chinese characters on the server through the program, the Chinese name is garbled
Problem analysis:
There is no doubt that It is caused by an encoding problem, so how to confirm which encoding it should be? Baidu later discovered that ubuntu uses utf-8 by default, while windows uses gbk by default, so just make sure that content containing Chinese paths is converted to gbk when it is sent out
Solution:
Conversion between utf8 and gbk:
Linux file character set conversion (utf8-gb2312)
1, command line
When performing encoding conversion on LINUX, you can use the iconv command to achieve this. This is for files, that is, converting the specified file from one encoding to another encoding.
iconv command usage is as follows:
iconv [选项...] [文件...]
Input/output format specification:
-f, --from-code=名称 原始文本编码 -t, --to-code=名称 输出编码
List All known character sets -l, --list
Output control:
-c 从输出中忽略无效的字符 -o, --output=FILE 输出文件 -s, --silent 关闭警告 --verbose 打印进度信息
Example:
iconv -f utf-8 -t gb2312 /server_test/reports/software_.txt > /server_test/reports/software_asserts.txt
Please download the libiconv library on the embedded board for the following required library files
2, function application
1.Header file
#include <iconv.h>
2. Function
(1) iconv_t iconv_open(const char *tocode, const char *fromcode);
This function explains which two encodings will be converted, tocode is the target encoding, fromcode is the original encoding, and this function returns a conversion handle for use by the following two functions.
(2) size_t iconv(iconv_t cd,char **inbuf,size_t *inbytesleft,char **outbuf,size_t *outbytesleft);
This function reads characters from inbuf and converts them Then output to outbuf, inbytesleft is used to record the number of characters that have not been converted, and outbytesleft is used to record the remaining space of the output buffer.
(3) int iconv_close(iconv_t cd);
This function is used to close the conversion handle and release resources.
3. Example: Conversion sample program implemented in C language
#include "stdio.h" #include "string.h" #include "iconv.h" #include "stdlib.h" static int charset_convert( const char *from_charset, const char *to_charset, char *in_buf, size_t in_left, char *out_buf, size_t out_left) { iconv_t icd = (iconv_t)-1; size_t sRet = -1; char *pIn = in_buf; char *pOut = out_buf; size_t outLen = out_left; if (NULL == from_charset || NULL == to_charset || NULL == in_buf || 0 >= in_left || NULL == out_buf || 0 >= out_left) { return -1; } icd = iconv_open(to_charset, from_charset); if ((iconv_t)-1 == icd) { return -1; } sRet = iconv(icd, &pIn, &in_left, &pOut, &out_left); if (( size_t )-1 == sRet) { iconv_close(icd); return -1; } out_buf[outLen - out_left] = 0; iconv_close(icd); return ( int )(outLen - out_left); } static int charset_convert_UTF8_TO_GB2312( char *in_buf, size_t in_left, char *out_buf, size_t out_left) { return charset_convert( "UTF-8" , "GB2312" , in_buf, in_left, out_buf, out_left); } static int charset_convert_GB2312_TO_UTF8( char *in_buf, size_t in_left, char *out_buf, size_t out_left) { return charset_convert( "GB2312-8" , "UTF-8" , in_buf, in_left, out_buf, out_left); } int main() { char *pIn = "hello 中国人" ; char *pOut = NULL; size_t outLen = 0; size_t inLen = 0; int iRet = -1; inLen = strlen (pIn); outLen = inLen * 3; pOut = ( char *) malloc ( sizeof ( char ) * outLen); if (NULL == pOut) { return -1; } memset (pOut, 0, sizeof ( char ) * outLen); iRet = charset_convert_UTF8_TO_GB2312(pIn, ( size_t )inLen, pOut, ( size_t )inLen); if (-1 == iRet) { return -1; } printf ( "out = %s;\touLen = %d\n" , pOut, iRet); free (pOut); pOut = NULL; return 0; }
This implementation can be cross-platform, and I personally think it is better~
ps: The ftp client I made before is On Windows, because Party A said it wanted to transplant it, it switched to Windows, and then the above problem occurred. Although I compared the Chinese solutions that appeared in the project, I at least accumulated experience in solving the problem (no Chinese, You can use Pinyin)
Related recommendations: "Linux Video Tutorial"
The above is the detailed content of What to do if linux socket is garbled. For more information, please follow other related articles on the PHP Chinese website!