How to set replication factors for HDFS directories?

Is it possible to set the replication factor for specific directory in HDFS to be one that is different from the default replication factor? This should set the existing files’ replication factors but also new files created in the specific directory. This can simplify the administration. We can set the replication factor of /tmp/ to…

How to get files without certain strings in their files names (reverse of *string*) on Linux?

To get files with certain string in their file names, it is quite straightforward: ls *string* However, how to do the reverse one: how to get files without certain strings in their files names on Linux? You can get a list of file names by a combination of find and xargs as follows: find ….

How to enable RPM Fusion for CentOS 6.6?

How to enable RPM Fusion for CentOS 6.6? Enable RPM fusion on RHEL 6 or compatible like CentOS: su -c ‘yum localinstall –nogpgcheck http://download1.rpmfusion.org/free/el/updates/6/i386/rpmfusion-free-release-6-1.noarch.rpm http://download1.rpmfusion.org/nonfree/el/updates/6/i386/rpmfusion-nonfree-release-6-1.noarch.rpm’ It will install https://fedoraproject.org/wiki/EPEL. If it fails to install EPEL automatically, you will need to install it manually. Reference: http://rpmfusion.org/Configuration

How to install vbetool on CentOS 6.6?

I found CentOS 6.6 does not ship vbetool in its default repositories, EPEL or RPMfusion. How to install vbetool on CentOS 6.6? First, download the source package from http://www.codon.org.uk/~mjg59/vbetool/ and unpack the package. Second, install needed packages: # yum pciutils-devel pciutils-devel-static libx86-devel During the building, it will try to find the libpci.a in a different…

How to convert .pptx slides to .jpg or .png images on Linux in command line?

How to convert .pptx slides to .jpg or .png images on Linux in command line? This following method works best for me. First, convert .pptx file to .pdf using libreoffice: libreoffice –headless –convert-to pdf file.pptx –headless makes libreoffice run in batch mode and not start the GUI. The pdf file will be named file.pdf by…

How to convert between Simplified Chinese and Traditional Chinese characters in text files on Linux?

How to convert between Simplified Chinese and Traditional Chinese characters in text files on Linux from command line? You can use opencc to convert between Traditional Chinese and Simplified Chinese: https://github.com/BYVoid/OpenCC For example, to transfer a file in simplified Chinese sc.txt to traditional Chinese: opencc -i sc.txt -o tc.txt -c zhs2zht.ini The authors also provide…

How to merge git branches quickly and correctly

Suppose I have following branches harryxiyou@common_vm ~/forest/sqle/sqle/scripts $ git branch * dev-harry master rc After I did some changes on dev-harry branch, I wanted to merge dev-harry into rc branch. 1, git checkout rc 2, git merge dev-harry References:http://stackoverflow.com/questions/24147169/merge-two-remote-branches-in-githttps://www.atlassian.com/git/tutorials/using-brancheshttp://git-scm.com/book/en/v2/Git-Branching-Basic-Branching-and-Merging I usually add –no-ff during git merge to force git to add a commit for the…

How to build mplayer with libdvdnav on Linux?

I find the mplayer from rpmfusion does not contain libdvdnav. How to build mplayer with libdvdnav on Linux? I did this on Fedora 21 to build mplayer with libdvdnav: Download mplayer src from http://www.mplayerhq.hu/design7/dload.html and extract the tarball. Install needed packages (you will need more if you need more features): # yum install gcc make…

How to understand some key system consistency algorithoms

When we design a system, we may want our systems to be consistency, scalability and so on. Currently, there are some famous consistency algorithms. How to understand them easily. 1, Paxos and its extensions 2, Replicated State Machine mechanisms 3, Quorum Welcome to adding other famous consistency algorithms and its understanding ;-) Reading text books…

When should the authors anonymize themselves in a paper submitted to a conference for review?

When should the authors anonymize themselves in a paper submitted to a conference for review? Several general concepts: Peer review is the evaluation of work by one or more people of similar competence to the producers of the work (peers). — Wikipedia Single-blind describes experiments where information that could introduce bias or otherwise skew the…

How to filter RSS feed items?

How to filter RSS items and leave only certain items that I want? Rules may be like: satisfies all or any rules { contains keyword; does not contain keyword } or etc. Two sServices that I find working well for me: Yahoo Piples: https://pipes.yahoo.com/pipes/</s> Pipes is a powerful composition tool to aggregate, manipulate, and mashup…

How to log connections hitting certain rules in iptables on Linux?

How to log connections hitting certain rules in iptables on Linux? Like the one that are dropped because of too frequently creating SSH connections. You can create a new chain named LOGNDROP that log the connections and drop them, then pass the connection to be redirected to the LOGNDROP chain. $tables -N LOGNDROP # Connections…