We enjoy readings.


Here are good books/ebooks that we hope you will find enjoy reading: books.

Posts on the Web

Here is a collection of articles and news on scalable systems. The links are updated via RSS sources. You can subscribe to this page via RSS feed or by email.

  • Posted on Friday November 06, 2020
    Hey, no power outages this week, so it's finally HighScalability time!   Stunning: Tycho Crater Region with Colours by Alain Paillou   Do you like this sort of Stuff? Without your support on Patreon this Stuff won't happen.    Know someone who could benefit from becoming one with the cloud? I wrote Explain the Cloud Like I'm 10 just ... Continue Reading »
  • Posted on Monday November 02, 2020
      This is guest a post by Preetam Jinka, Senior Infrastructure Engineer at ShiftLeft. Originally published here. ShiftLeft NextGen Static Analysis (NG SAST) is a software-as-a-service static analysis solution that allows developers to scan every pull request for security issues. Earlier this year we released Secrets, Security Insights, and a v4 API. Secrets ... Continue Reading »
  • Posted on Tuesday September 22, 2020
      What do you do when you find a snake in your datacenter? You might say this. (NSFW)   Fault tolerance is the property that enables a system to continue operating properly in the event of the failure of some of its components. You might think Facebook solved all of its fault tolerance problems ... Continue Reading »
  • Posted on Monday September 21, 2020
      It's not all that different really, especially that part where you can lose all your bitcoin. Here's an excerpt from East of Eden by John Steinbeck: Say, Carlton, how do you go about telegraphing money?” “Well, you bring me a hundred and two dollars and sixty cents and I send a wire telling ... Continue Reading »
  • Posted on Friday September 18, 2020
    Hey, it's HighScalability time!   I can't wait for the duel. Just don't shoot into the air. Do you like this sort of Stuff? Without your support on Patreon this kind of Stuff won't happen.  Know someone who could benefit from becoming one with the cloud? Of course you do. I wrote Explain the Cloud Like ... Continue Reading »
  • Posted on Thursday April 09, 2020
    Querying transaction content out from a blockchain network is a common practice used by common scenarios like exploring the blockchain history or verifying the blockchain transaction content from a known ID. In Hyperledger Fabric, the transaction can be queried using a special system chaincode QSCC (Query System Chaincode) which is ... Continue Reading »
  • Posted on Wednesday April 08, 2020
    This post lists some general- and special-purpose blockchains nowadays available with a short description in one sentence. Readers can have a quick understanding of what a blockchain network is built for and what uniqueness that blockchain has. I also give links to beginner friendly resources as learn materials. General purpose ... Continue Reading »
  • Posted on Tuesday April 07, 2020
    Hyperledger Fabric is a consortium blockchain system. It’s performance is relatively good and its modular architecture enables it to be usable in many scenarios. Hyperledger Fabric itself has rich documents and samples of test networks. For beginners, deploying a new network for trying and testing still consumes quite some time. ... Continue Reading »
  • Posted on Tuesday January 28, 2020
    OneDrive is one of the good cloud storage services available and there is a business version called OneDrive for Business. Microsoft’s Office 365 plan is widely used including Exchange Email service and OneDrive for Business. However, there is no official client released yet for Linux users. Insync is a third ... Continue Reading »
  • Posted on Tuesday November 27, 2018
    Reading: Years in Big Data. Months with Apache Flink. 5 Early Observations With Stream Processing: https://data-artisans.com/blog/early-observations-apache-flink. The article suggest adopting the right solution, Flink, for big data processing. Flink is interesting and built for stream processing. The broader view and take away may be to solve problems using the right ... Continue Reading »
  • Posted on Saturday March 24, 2018
    How to deactivate a LVM logical volume activated by #vgchange -aay on Linux You may need to make a LVM volume group inactive and thus unknown to the kernel. To deactivate a volume group, use the -a (--activate) argument of the vgchange command. To deactivates the volume group vg, use ... Continue Reading »
  • Posted on Saturday March 24, 2018
    One of HDFS cluster’s hdfs dfsadmin -report reports: Under replicated blocks: 139016 Blocks with corrupt replicas: 9 Missing blocks: 0 The “Under replicated blocks” can be re-replicated automatically after some time. How to handle the missing blocks and blocks with corrupt replicas in HDFS? Understanding these blocks A block is ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The encoding of x86 and x86-64 instructions is well documented in Intel or AMD’s manuals. However, they are not quite easy for beginners to start with to learn encoding of the x86-64 instructions. In this post, I will give a list of useful manuals for understanding and studying the x86-64 ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The metadata checkpointing in HDFS is done by the Secondary NameNode to merge the fsimage and the edits log files periodically and keep edits log size within a limit. For various reasons, the checkpointing by the Secondary NameNode may fail. For one example, HDFS SecondaraNameNode log shows errors in its ... Continue Reading »
  • Posted on Sunday August 27, 2017
    Introduction In general, if we want to debug Linux Kernel, there are lots of tools such as Linux Perf, Kprobe, BCC, Ktap, etc, and we can also write kernel modules, proc subsystems or system calls for some specific debugging aims. However, if we have to instrument kernel to achieve our ... Continue Reading »
  • Posted on Saturday August 26, 2017
    Introduction As we know, network subsystems are important in computer systems since they are I/O systems and need to be optimized with many algorithms and skills. This article will introduce how QEMU/KVM [2] network part works. In order to put everything simple and easy to understand, we will begin with ... Continue Reading »
  • Posted on Sunday August 20, 2017
    Abstract Most popular task monitor systems (such as top, iotop, proc, etc) can only get tasks’ disk I/O information like tasks’ I/O utilization percentage every seconds due to kernel timer/tick frequency and high time cost of system interfaces. This article presents I/O Microscopy, a new way to get tasks’ disk ... Continue Reading »
  • Posted on Tuesday February 14, 2017
    Motivation Recently, I find it is hard to know the percentage of time that one process uses to wait for synchronous I/O (eg, read, etc). One way is to use the taskstats API provided by Linux Kernel [1]. However, for this way, the precision may be one problem. With this ... Continue Reading »
  • Posted on Sunday November 29, 2015
    Amazon S3 is a widely used public cloud storage system. S3 allows an object/file to be up to 5TB which is enough for most applications. The AWS Management Console provides a Web-based interface for users to upload and manage files in S3 buckets. However, uploading a large files that is ... Continue Reading »
  • Posted on Tuesday March 10, 2015
    Retail is one of the most important business domains for data science and data mining applications because of its prolific data and numerous optimization problems such as optimal prices, discounts, recommendations, and stock levels that can be solved using data analysis methods. The rise of omni-channel retail that integrates marketing, ... Continue Reading »
Please share if you like this post:


  1. Note for blog authors: if you do not want your articles appear here (we just post a excerpt, not the full content), please drop me a message and I will delete them. If you have good suggestions on blogs/sites (with a RSS feed) to add to this list, please also let me know.

  2. Yeah, the poll() function is broken on MacOS and therefore is not supported in Python for the Mac.The select library supports other polling mechanisms; it essentially exposes whatever the OS supports. Let me look into an update to the code that will use kevent on Macs.

Leave a Reply

Your email address will not be published. Required fields are marked *