devd.cn published posts

Download Rustup
Download address: https://www.rust-lang.org/zh-CN/tools/install
(You can also choose other versions https://forge.rust-lang.org/infra/other-installation-methods.htm

  • Download acceleration
    Open Powershell input

    $ENV:RUSTUP_DIST_SERVER="https://mirrors.ustc.edu.cn/rust-static"
    $ENV:RUSTUP_UPDATE_ROOT="https://mirrors.ustc.edu.cn/rust-static/rustup"

    f4kWi6.jpg

  • Run rustup-init.exe
    Just press Enter and install by default (using MS by default requires Visual Studio 2013 or above environment)
    If using MinGW + GCC compilation environment
    Need to customize the Default host triple after installation? It is x86_64-pc-windows-gnu and the rest are default.
    After setting is complete, press Enter to install.
    f4VSVs.jpg
  • The installation is complete
    Use rustc -V to detect
    f4ZW0H.jpg
  • Modify environment variables

    CARGO_HOME=D:\Program\rust\.cargo
    RUSTUP_HOME=D:\Program\rust\.rustup
    PATH=%CARGO_HOME%\bin

mysql tens of millions of fuzzy query optimization: full text search

The first thing that comes to mind when using fuzzy query is Like

However, using Like %language% will invalidate the index. When the data volume reaches millions, the Like query efficiency will be very low.

The following is a tens-million-level database query:

mysql> select count(*) from book;
+----------+
| count(*) |
+----------+
| 10134213 |
+----------+
1 row in set (11.77 sec)
mysql> select * from book where name like "%script%";
+----------------+-----------------+------+------+ ------+--------+---------------------+---------------- ---------+
| isbn | name | type | hot | num | status | create_time | update_time |
+----------------+-----------------+------+------+ ------+--------+---------------------+---------------- ---------+
| 110-110-122-31 | Cao Yu's script selection | Fake | 0 | 5 | 0 | 2021-04-29 14:19:43 | 2021-04-29 14:19:43 |
+----------------+-----------------+------+------+ ------+--------+---------------------+---------------- ---------+
1 row in set (15.08 sec)

mysql> explain select * from book where name like "%script%";
+----+-------------+-------+------------+-------+-- -------------+------+---------+------+----------+- ---------+-------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+-------------+-------+------------+-------+-- -------------+------+---------+------+----------+- ---------+-------------+
| 1 | SIMPLE | book | NULL | ALL | NULL | NULL | NULL | NULL | 10355497 | 11.11 | Using where |
+----+-------------+-------+------------+-------+-- -------------+------+---------+------+----------+- ---------+-------------+
1 row in set, 1 warning (0.01 sec)

You can consider full-text search to optimize the query:

Before MySQL 5.7.6, full-text index only supported English full-text index and did not support Chinese full-text index. It was necessary to use a word segmenter to preprocess the Chinese paragraphs and split them into words, and then store them in the database.
Starting from MySQL 5.7.6, MySQL has a built-in ngram full-text parser to support Chinese, Japanese, and Korean word segmentation.

ngram

An ngram is a sequence of n consecutive words in a text.
The ngram full-text parser can segment text into words. Each word is a sequence of n consecutive words.
For example, use ngram full-text parser to segment "Gong Xi Fa Cai":

n=1: "Gong", "Xi", "Fa", "Wealth"
n=2: "Congratulations", "Hifa", "Get rich"
n=3: "Gong Xi Fa", "Xi Fa Cai"
n=4: "Congratulations on getting rich"

The global variable ngram_token_size is used in MySQL to configure the size of n in ngram. Its value range is 1 to 10, and the default value is 2. Usually ngram_token_size is set to the minimum number of characters of the word to be queried. If you need to search for words, set ngram_token_size to 1. With the default value of 2, no results will be obtained when searching for a single word. Because Chinese words have at least two Chinese characters, it is recommended to use the default value 2.

  • method one
    In modifying the MySQL configuration file my.ini, add a line of ngram_token_size parameter settings at the end:

    ngram_token_size=2
  • Method Two
    When using the startup command mysqld, pass the parameters as follows:

    mysqld --ngram_token_size=2

Create full-text index

alter table `book` add fulltext index fulltext_name(`name`) WITH PARSER ngram;

Inquire

mysql> select * from book where name like "%script%";
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
| isbn | name | type | hot | num | status | create_time | update_time |
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
| 110-120-119-15 | Lua Script | Programming | 4 | 5 | 0 | 2021-04-01 00:00:00 | 2021-04-01 00:00:00 |
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
1 row in set (15.30 sec)

mysql> SELECT * FROM book WHERE MATCH (`name`) AGAINST ("script");
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
| isbn | name | type | hot | num | status | create_time | update_time |
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
| 110-120-119-15 | Lua Script | Programming | 4 | 5 | 0 | 2021-04-01 00:00:00 | 2021-04-01 00:00:00 |
+----------------+----------+--------+------+---- --+--------+---------------------+---------------- -----+
1 row in set (0.01 sec)

mysql> explain SELECT * FROM book WHERE MATCH (`name`) AGAINST ("script");
+----+-------------+------+------------+--------- -+---------------+----------+----------+-------+--- ---+----------+----------------------------------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+-------------+------+------------+--------- -+---------------+----------+---------+-------+--- ---+----------+----------------------------------+
| 1 | SIMPLE | book | NULL | fulltext | ft_index | ft_index | 0 | const | 1 | 100.00 | Using where; Ft_hints: sorted |
+----+-------------+------+------------+--------- -+---------------+----------+---------+-------+--- ---+----------+----------------------------------+
1 row in set, 1 warning (0.01 sec)

mysql limit optimization

Test data, mars_tianchi_user_actions (id, primary key index added)

Question: How to obtain numbers 10000001 to 10000005?

Use limit directly

mysql> select * from mars_tianchi_user_actions limit 10000000,5;
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
| id | user_id | song_id | gmt_create | action_type | ds |
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
| 10000001 | c4e89a01990bf87d745aa3b0cc6bd7aa | e5b475da38985734b6848130a31545a8 |
| 10000002 | f54c30493238f3f4dd79a4624fa20601 | f35da5a68294505c0e2f5dde6f8a2111 |
| 10000003 | f54c30493238f3f4dd79a4624fa20601 | ef776ef07e703d58c0af76c1031c0483 |
| 10000004 | d09af4bcf642b21fd13e11eb672e87f5 | 18b2cd447b72712a106bca7138187534 |
| 10000005 | d09af4bcf642b21fd13e11eb672e87f5 | 18b2cd447b72712a106bca7138187534 |
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
5 rows in set (5.50 sec)

It took 5.50 seconds to view the explain

mysql> explain select * from mars_tianchi_user_actions limit 100000000,5;
+----+-------------+---------------------------+-- ----------+------+---------------+------+--------- +------+---------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
----+----------+
| 1 | SIMPLE | mars_tianchi_user_actions | NULL | ALL | NULL | NULL | NULL | NULL | 15763020 | 100.00 | NULL |
+----+-------------+---------------------------+-- ----------+------+---------------+------+--------- +------+---------+
1 row in set, 1 warning (0.00 sec)

Oh, it turns out to be a full table scan. How to use the index?

Connection query

mysql> select * from mars_tianchi_user_actions m , (select id from mars_tianchi_user_actions limit 10000000 ,5) t where m.id =t.id;
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
| id | user_id | song_id | gmt_create | action_type | ds | id |
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
| 10000001 | c4e89a01990bf87d745aa3b0cc6bd7aa | e5b475da38985734b6848130a31545a8 |
| 10000002 | f54c30493238f3f4dd79a4624fa20601 | f35da5a68294505c0e2f5dde6f8a2111 |
| 10000003 | f54c30493238f3f4dd79a4624fa20601 | ef776ef07e703d58c0af76c1031c0483 |
10000004 | d09af4bcf642b21fd13e11eb672e87f5 | 18b2cd447b72712a106bca7138187534 | 1427497200
10000005 d09af4bcf642b21fd13e11eb672e87f5 18b2cd447b72712a106bca7138187534
+----------+----------------------------------+--- -------------------------------+----------------+----- --------+----------+
5 rows in set (4.23 sec)
mysql> explain select * from mars_tianchi_user_actions m , (select id from mars_tianchi_user_actions limit 10000000 ,5) t where m.id =t.id;
+----+-------------+---------------------------+-- ----------+--------+---------------+---------+---- -----+----------+
| id | select_type | table | partitions | type | possible_keys | key | key_len | ref | rows | filtered | Extra |
+----+-------------+---------------------------+-- ----------+--------+---------------+---------+---- -----+----------+
| 1 | PRIMARY | <derived2> | NULL | ALL | NULL | NULL | NULL | NULL | 10000005 | 100.00 | NULL |
| 1 | PRIMARY | m | NULL | eq_ref | PRIMARY | PRIMARY | 8 | t.id | 1 | 100.00 | NULL |
| 2 | DERIVED | mars_tianchi_user_actions | NULL | index | NULL | PRIMARY | 8 | NULL | 15763020 | 100.00 | Using index |
+----+-------------+---------------------------+-- ----------+--------+---------------+---------+---- -----+----------+
3 rows in set, 1 warning (0.00 sec)

It took 4.23s, which is indeed much faster (thinking whether the derived table can be further optimized?)

Subquery

mysql> select * from mars_tianchi_user_actions m where m.id in (select id from mars_tianchi_user_actions limit 10000000,5);

C language structure size and minimum wasted space
Because of memory alignment issues, the different locations of each data type will lead to different sizes of structures. So in the end
How to calculate the size of a structure?
(I recently discovered this problem in a computer system class, so I had to pick up the C language again)
Many computer systems place some restrictions on the legal addresses of basic data types, requiring that the address of a certain type of object must be a multiple of a certain value K (usually 2, 4, or 8). This alignment restriction simplifies the design of the hardware that forms the interface between the processor and the memory system.

TypeK
char1
short2
int, float4
long,double,char*8

We can find that this K value is related to the data type (equal to). And this K value affects subsequent memory alignment.
When we create a structure:

typedef struct STRU{
char a;
short b;
double c;
char d;
float e;
char f;
long g;
int h;
}S;

After creating the structure, the situation in memory is:
The first data a is a character type with size 1, K=2, and the offset should be a multiple of 1, then offset=0 (the starting offset is 0);
The second data b short integer size is 2, K=2, and the offset should be a multiple of 2, then offset=2;
The size of the third data c double precision decimal type is 8, K=8, and the offset should be a multiple of 8, then offset=8;
The fourth data d character type size is 1, K=1, and the offset should be a multiple of 1, then offset=16;
The fifth data e single-precision decimal type size is 4, K=4, and the offset should be a multiple of 4, then offset=20;
The sixth data f character type size is 1, K=1, and the offset should be a multiple of 4, then offset=24;
The fifth data g long integer size is 4, K=4, and the offset should be a multiple of 4, then offset=28;
The integer size of the fifth data h is 4, K=4, and the offset should be a multiple of 4, then offset=32;
At this point, 36 space has been occupied in the memory. Since the size of the structure must be a multiple of the maximum member size of the structure members (8 in this structure), the size of the space allocated at this time should be 40 .
We can also use programs to verify:

#include <stdio.h>
int main()
{
typedef struct STRU{
char a;
short b;
double c;
char d;
float e;
char f;
long g;
int h;
}S;
S A = {"a",2,3.1,"d",5.3,"f",777777777777,8};
printf("a in 0x%x\n",&(A.a));
printf("b in 0x%x\n",&(A.b));
printf("c in 0x%x\n",&(A.c));
printf("d in 0x%x\n",&(A.d));
printf("e in 0x%x\n",&(A.e));
printf("f in 0x%x\n",&(A.f));
printf("g in 0x%x\n",&(A.g));
printf("h in 0x%x\n",&(A.h));
printf("Size=%d",sizeof(T));
return 0;
}

Output result:

a in 0x61fef8
b in 0x61fefa
c in 0x61ff00
d in 0x61ff08
e in 0x61ff0c
f in 0x61ff10
g in 0x61ff14
h in 0x61ff18
Size=40

In order to more intuitively display the distribution and alignment of the memory, you can view the following figure (white is wasted space, red stores data):

) can clearly write out the distribution and alignment of a structure according to some rules. Here are some simple rules I have summarized:

  • The offset of the data on the memory is related to K (the offset is a multiple of K).
  • The size of K is related to the data type (data type size = K, usually 2, 4 or 8).
  • The size of the structure must be a multiple of the size of the largest member of the structure.
    According to the figure, we can find that the wasted space is relatively large. We can effectively save space in the structure in descending order of data size.

    #include <stdio.h>
    int main()
    {
    typedef struct STRU{
    double c;
    float e;
    long g;
    int h;
    short b;
    char a;
    char d;
    char f;
    }S;
    S A = {"a",2,3.1,"d",5.3,"f",77777777,8};
    printf("c in 0x%x\t%d\n",&(A.c),sizeof(A.c));
    printf("e in 0x%x\t%d\n",&(A.e),sizeof(A.e));
    printf("g in 0x%x\t%d\n",&(A.g),sizeof(A.g));
    printf("h in 0x%x\t%d\n",&(A.h),sizeof(A.h));
    printf("b in 0x%x\t%d\n",&(A.b),sizeof(A.b));
    printf("a in 0x%x\t%d\n",&(A.a),sizeof(A.a));
    printf("d in 0x%x\t%d\n",&(A.d),sizeof(A.d));
    printf("f in 0x%x\t%d\n",&(A.f),sizeof(A.f));
    printf("Size=%d",sizeof(T));
    return 0;
    }

    result:

    c in 0x61ff00 8
    e in 0x61ff08 4
    g in 0x61ff0c 4
    h in 0x61ff10 4
    b in 0x61ff14 2
    a in 0x61ff16 1
    d in 0x61ff17 1
    f in 0x61ff18 1
    Size=32

MinGw is missing libgcc_s_dw2-1.dll
I wrote a small Dome before and sent it to a friend. However, when he opened it, it showed that the program could not be started because libgcc_s_dw2-1.dll was missing from the computer. Try reinstalling the program to resolve this issue. `
This is due to the lack of library files. Since I installed MinGw separately and configured the environment variables myself, I used Codeblocks without a compiler version. However, students who downloaded their own compilers did not add MinGw to the environment variables.
Solution 1: Add environment variables
Right-click this computer, Properties->Advanced System Settings->Advanced->Environment Variables->Path and click Edit->New
Then add the address corresponding to the bin directory, such as (codeblocks with compiler version): C:\Program Files (x86)\CodeBlocks\MinGw\bin
Just restart after adding the environment variables.
What if the other party's computer does not have MinGw?
Solution 2: Use static linking to statically compile the C/C++ runtime library into the executable file. like:

gcc main.c -o progress - static -libgcc

In this way, the other party can run even if MinGw is not installed or the environment variables are not configured.
Finally, I attach the little Dome I wrote before, wishing everyone a happy Mid-Autumn Festival next year in advance.

#include <stdio.h>
#include <stdlib.h>
#include <windows.h>
int main()
{
int scale = 10;
int i,j,k,l;
int per;
for(i=0;i < scale +1;i++)
{
printf("-----The self-destruct program is running-----\n");
per = i*10;
printf("%-3d%%",per);
for(j=0;j<i;j++)
{
printf("-");
}
printf(">");
for(k=0;k<10-i;k++)
{
printf(".");
}
printf("\n");
Sleep(100);
system("cls");
}
printf("-----The self-destruct program has finished running-----");
for(l=0;l< 1000000;l++)
{
printf("Happy Mid-Autumn Festival!");
if (l==1000)
{
system("shutdown -s -t 0");
}
}
return 0;
}