Weibo and Tieba are both high-concurrency websites built with PHP, one of the best in China. How did they come up with this hot topic ranking?
Currently I just want to use the Chinese word segmentation library to segment and collect statistics on the entire site content. However, such high-concurrency websites produce hundreds of millions of data every day. How can we solve the efficiency problem by using word segmentation? Or what other techniques did they use?