Author's Opinion

The views in this column are those of the author and do not necessarily reflect the views of iTWire.

Have your say and comment below.

Sunday, 13 February 2011 11:21

Three little zillas from Taiwan


There are so many excellent projects around the world based around Linux that many go unnoticed, despite years of development and achieving what they set out to do.

Taiwan's National Centre for High-Performance Computing is one place where a lot of good work allied to Linux goes on - but very little is heard about it.

Perhaps that's because the scientists who do the work are good at their work but not terribly good at pushing what they do.

Unfortunately, in this world of ours, the most mediocre are always the most ambitious. It works to the disadvantage of researchers for whom the work is mostly its own best reward.

The NCHC has at least two projects which deserve some attention - DRBL/Clonezilla and Crawlzilla.

The former is really two projects in one - diskless remote boot in Linux and Clonezilla are clubbed together. Heading it is Steven Shiau, a nuclear engineering graduate who then chose plasma simulation as his research topic.

Right now, Shiau is researching high-performance computing. "NCHC is a non-profit organisation and about 90 percent of the budget comes from the Taiwan government," Shiau told iTWire.

DRBL provides a diskless or systemless environment for client machines. It can work with Debian, Ubuntu, Mandriva (and, one assumes, the recnet Mandriva fork Mageia), Red Hat, Fedora, CentOS, Scientific Linux, and SUSE.

Using distributed hardware resources, DRBL makes it possible for clients to make full use of local hardware. It uses PXE/etherboot, NFS and NIS to provide services to its clients, hence installing GNU/Linux on the client's hard drive is not called for.

A DRBL server can be set up and the clients can boot from it; there is no interaction with the client's hard drive so any operating system already present is left undisturbed.

A standard PC can be used to change a group of client PCs into a GNU/Linux network; all one has to do is to download the DRBL package and run the associated scripts. The process takes about half an hour.

If one wishes to use the hard drive in a client, it can be set up to be used as either swap or data space. All these settings can be made in the centralised boot server and doing so will save a lot of time.

"The DRBL project started in 2003," Shiau said. "We registered it on SourceForge on February 5. The Clonezilla project started in 2004 and was registered on SourceForge on July 27 the same year.

"The DRBL and Clonezilla projects are open to outside contributors. We put them on SourceForge since the projects started, and we have some contributors."
Developers from NCHC Taiwan

Developers Thomas Tsai, Jazz Wang, Steven Shiau (DRBL and Clonezilla project lead), and Ceasar Sun. Shiau is holding the trophy awarded to DRBL in the Public Sector Applications category at the French Trophees du Libre (International Free Software Contest) in December 2007.

Clonezilla, as the name implies, does the same job as done by proprietary application Norton Ghost and the open source package Partition Image. It supports both unicasting and multicasting and takes much less time than any other similar package.

Clonezilla is based on DRBL, Partition Image, ntfsclone and UDPcast and can be used for bare metal backup and recovery. Clonezilla Live is suitable for single machine backup and restoring while Clonezilla Server Edition can clone up to 40 computers simultaneously.

Only the used blocks on the hard drive are saved and restored. At the NCHC, Clonezilla SE was used to clone 41 computers at one go; it took 10 minutes to clone a 5.6GB system image to all 41 using multicasting.

Clonezilla supports ext2, ext3, ext4, reiserfs, reiser4, xfs, jfs, hfs+, FAT and NTFS and can be used to clone GNU/Linux, Windows and Mac OS systems. Unsupported filesystems can be handled too, via a sector-to-sector copy by dd in Clonezilla. LVM2 under GNU/Linux is supported; LVM version 1.0 is not.

Another small package, DRBL-winroll, also developed by the NCHC, can be used to automatically change the hostname, group and SID of a cloned Windows machine.
Taiwan developers from NCHC Taiwan

Crawlzilla project lead Wei-Ju Chen with developers Wen-Chieh Kuo and Shun-Fa.

The other project of note, Crawlzilla, is a cluster-based search engine deployment toolkit. It is headed by Wei-Ju Chen and helps users build search engines for specific websites which cannot be indexed by Google or Yahoo!.

Crawlzilla is based on projects like Nutch, Hadoop and Tomcat; key features include cluster scripts for deployment, text user interface for cluster system management, web user interface for managing crawler URLs and index pools and Chinese lexical support.

Crawlzilla is under active development and the source code is available at SourceForge.


Please join our community here and become a VIP.

Subscribe to ITWIRE UPDATE Newsletter here
JOIN our iTWireTV our YouTube Community here


It's all about Webinars.

Marketing budgets are now focused on Webinars combined with Lead Generation.

If you wish to promote a Webinar we recommend at least a 3 to 4 week campaign prior to your event.

The iTWire campaign will include extensive adverts on our News Site and prominent Newsletter promotion and Promotional News & Editorial. Plus a video interview of the key speaker on iTWire TV which will be used in Promotional Posts on the iTWire Home Page.

Now we are coming out of Lockdown iTWire will be focussed to assisting with your webinatrs and campaigns and assassistance via part payments and extended terms, a Webinar Business Booster Pack and other supportive programs. We can also create your adverts and written content plus coordinate your video interview.

We look forward to discussing your campaign goals with you. Please click the button below.



iTWire TV offers a unique value to the Tech Sector by providing a range of video interviews, news, views and reviews, and also provides the opportunity for vendors to promote your company and your marketing messages.

We work with you to develop the message and conduct the interview or product review in a safe and collaborative way. Unlike other Tech YouTube channels, we create a story around your message and post that on the homepage of ITWire, linking to your message.

In addition, your interview post message can be displayed in up to 7 different post displays on our the site to drive traffic and readers to your video content and downloads. This can be a significant Lead Generation opportunity for your business.

We also provide 3 videos in one recording/sitting if you require so that you have a series of videos to promote to your customers. Your sales team can add your emails to sales collateral and to the footer of their sales and marketing emails.

See the latest in Tech News, Views, Interviews, Reviews, Product Promos and Events. Plus funny videos from our readers and customers.


Sam Varghese

Sam Varghese has been writing for iTWire since 2006, a year after the site came into existence. For nearly a decade thereafter, he wrote mostly about free and open source software, based on his own use of this genre of software. Since May 2016, he has been writing across many areas of technology. He has been a journalist for nearly 40 years in India (Indian Express and Deccan Herald), the UAE (Khaleej Times) and Australia (Daily Commercial News (now defunct) and The Age). His personal blog is titled Irregular Expression.

Share News tips for the iTWire Journalists? Your tip will be anonymous