首先让我们来看一个我朋友希望转换的纯文本文件的例子:
以下为引用的内容:
复制代码 代码如下:
Green for Mars!
John R. Doe
The idea of little green men from Mars, long a staple of science fiction, may soon turn out to be less fantasy and more fact.
Recent samples sent by the latest Mars exploration team indicate a high presence of chlorophyll in the atmosphere. Chlorophyll, you will recall, is what makes plants green. It's quite likely, therefore, that organisms on Mars will have, through continued exposure to the green stuff, developed a greenish tinge on their outer exoskeleton.
An interview with Dr. Rushel Bunter, the head of ASDA's Mars Colonization Project blah blah...
What does this mean for you? Well, it means blah blahblah...
Track follow-ups to this story online at http://www.mars-connect.dom/. To see pictures of the latest samples, log on to http://www.asdamcp.dom/galleries/220/
相当标准的文本:它有一个标题、一个署名和很多段的文字。把这篇文档转换成为HTML真正需要做的是使用HTML的分行和分段标记把原文的布局保留在Web页面上。特殊的标点符号需要被转换成为对应的HTML符号,超链接需要变得可以点击。
下面的PHP代码(列表A)就会完成上面所有的任务:
列表A
让我们来看看它是如何工作的:
复制代码 代码如下:
// set source file name and path
$source = "toi200686.txt";
// read raw text as array
$raw = file($source) or die("Cannot read file");
// retrieve first and second lines (title and author)
$slug = array_shift($raw);
$byline = array_shift($raw);
// join remaining data into string
$data = join('', $raw);
// replace special characters with HTML entities
// replace line breaks with
$html = nl2br(htmlspecialchars($data));
// replace multiple spaces with single spaces
$html = preg_replace('/ss+/', ' ', $html);
// replace URLs with
elements
$html = preg_replace('/s(w+://)(S+)/', ' ', $html);
// start building output page
// add page header
$output =<<< HEADER
HEADER;
// add page content
$output .= "
$slug
";
$output .= "
By $byline
";
$output .= "
$html
";
// add page footer
$output .=<<< FOOTER
FOOTER;
// display in browser
echo $output;
// AND/OR
// write output to a new .html file
file_put_contents(basename($source, substr($source, strpos($source, '.'))) . ".html", $output) or die("Cannot write file");
?>
The first step is to read the pure ASCII file into a PHP array. This is easily accomplished using the file() function, which converts each line of the file into an element in a numerically indexed array.
Then, the title and author lines (I assume both of these are the first two lines of the file) are extracted from the array using the array_shift() function and placed into separate variables. The remaining members of the array are then concatenated into a string. This string now contains the entire text of the article.
Special symbols like "'", "<" and ">" in the article body are converted into corresponding HTML symbols through the htmlspecialchars() function. In order to preserve the original format of the article, line breaks and paragraphs are converted into HTML
elements using the nl2br() function. Multiple spaces in the middle of the article are compressed into one space through simple string replacement.
The URL in the article body is detected using regular expressions, with elements on both sides. When the page is displayed in a web browser, it converts the URL into a clickable hyperlink.
Then create the output HTML page using standard HTML rules. The title, author and body of the article are all formatted using CSS style rules. Although this script doesn't do that, this is where you can customize the look of the final page. You can add graphic elements, colors, or other flashy content to the template.
Once the HTML page is constructed, it can be sent to the browser or saved as a static file using file_put_contents(). Note that when saving, the original filename will be broken down and a new filename (called filename.html) will be created for the newly created web page. You can then publish the Web page to a Web server, save it to a CD, or edit it further.
Note: When using this script to create and save HTML files to disk, you must ensure that the script has write permissions for the directory where the file is saved.
As you can see, if you have a standard format ASCII plain text data file, you can convert it into a usable Web page fairly quickly using PHP. If you already have a Web site and plan to add new Web pages to it, it is fairly easy to adjust the templates used by the page builder to adapt them to the look of the original Web site.
http://www.bkjia.com/PHPjc/320539.htmlwww.bkjia.comtruehttp: //www.bkjia.com/PHPjc/320539.htmlTechArticleFirst let’s look at an example of a plain text file that my friend wants to convert: The following is the quoted content: Copy The code is as follows: Green for Mars! John R. Doe The idea of little...