We enjoy readings.


Here are good books/ebooks that we hope you will find enjoy reading: books.

Posts on the Web

Here is a collection of articles and news on scalable systems. The links are updated via RSS sources. You can subscribe to this page via RSS feed or by email.

  • Posted on Monday December 13, 2021
      This is a guest post by Ankit Sirmorya. Ankit is working as a Machine Learning Lead/Sr. Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. Ankit has been working on applying machine learning to solve ambiguous business problems and improve customer experience. For instance, ... Continue Reading »
  • Posted on Tuesday November 30, 2021
      Today I have the pleasure of announcing my new app—Max reHIT Workout—on Product Hunt. Max reHIT Workout is an exercise app that guides you through interval workouts. I won’t pitch the app here. I'll just say I’m proud of how it turned out and if you want an optimal algorithm for exercising, ... Continue Reading »
  • Posted on Saturday October 30, 2021
    Email sender fraud is an annoying problem. Malicious email senders send spam emails with email senders that are under non-authorized domain names. This may affect the reputation of the domains used by the fraud sender. On another hand, lots emails will be rejected and reverted back, which causes lots additional ... Continue Reading »
  • Posted on Monday October 11, 2021
    What would a totally new search engine architecture look like? Who better than Julien Lemoine, Co-founder & CTO of Algolia, to describe what the future of search will look like. This is the second article in a series. Here's Part 1. Search engines need to support fast scaling for both Read ... Continue Reading »
  • Posted on Monday October 04, 2021
    Hey, HighScalability is here again! The circulatory system of the internet. @tylermorganwall Love this Stuff? I need your support on Patreon to keep this stuff going. Sorry for the long gap in posting, but I’ve been building a new app. I’m looking for testers for my new iOS fitness app: Max reHIT Workout. ... Continue Reading »
  • Posted on Monday August 02, 2021
      What would a totally new search engine architecture look like? Who better than Julien Lemoine, Co-founder & CTO of Algolia, to describe what the future of search will look like. This is the first article in a series. Search engines, and more generally, information retrieval systems, play a central role ... Continue Reading »
  • Posted on Thursday April 09, 2020
    Querying transaction content out from a blockchain network is a common practice used by common scenarios like exploring the blockchain history or verifying the blockchain transaction content from a known ID. In Hyperledger Fabric, the transaction can be queried using a special system chaincode QSCC (Query System Chaincode) which is ... Continue Reading »
  • Posted on Wednesday April 08, 2020
    This post lists some general- and special-purpose nowadays available with a short description in one sentence. Readers can have a quick understanding of what a blockchain network is built for and what uniqueness that blockchain has. I also give links to beginner friendly resources as learn materials. General purpose blockchains ... Continue Reading »
  • Posted on Tuesday April 07, 2020
    Hyperledger Fabric is a consortium blockchain system. It’s performance is relatively good and its modular architecture enables it to be usable in many scenarios. Hyperledger Fabric itself has rich documents and samples of test networks. For beginners, deploying a new network for trying and testing still consumes quite some time. ... Continue Reading »
  • Posted on Tuesday January 28, 2020
    OneDrive is one of the good cloud storage services available and there is a business version called OneDrive for Business. Microsoft’s Office 365 plan is widely used including Exchange Email service and OneDrive for Business. However, there is no official client released yet for Linux users. Insync is a third ... Continue Reading »
  • Posted on Tuesday November 27, 2018
    Reading: Years in Big Data. Months with Apache Flink. 5 Early Observations With Stream Processing: https://data-artisans.com/blog/early-observations-apache-flink. The article suggest adopting the right solution, Flink, for big data processing. Flink is interesting and built for stream processing. The broader view and take away may be to solve problems using the right ... Continue Reading »
  • Posted on Saturday March 24, 2018
    How to deactivate a LVM logical volume activated by #vgchange -aay on Linux You may need to make a LVM volume group inactive and thus unknown to the kernel. To deactivate a volume group, use the -a (--activate) argument of the vgchange command. To deactivates the volume group vg, use ... Continue Reading »
  • Posted on Saturday March 24, 2018
    One of HDFS cluster’s hdfs dfsadmin -report reports: Under replicated blocks: 139016 Blocks with corrupt replicas: 9 Missing blocks: 0 The “Under replicated blocks” can be re-replicated automatically after some time. How to handle the missing blocks and blocks with corrupt replicas in HDFS? Understanding these blocks A block is ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The encoding of x86 and x86-64 instructions is well documented in Intel or AMD’s manuals. However, they are not quite easy for beginners to start with to learn encoding of the x86-64 instructions. In this post, I will give a list of useful manuals for understanding and studying the x86-64 ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The metadata checkpointing in HDFS is done by the Secondary NameNode to merge the fsimage and the edits log files periodically and keep edits log size within a limit. For various reasons, the checkpointing by the Secondary NameNode may fail. For one example, HDFS SecondaraNameNode log shows errors in its ... Continue Reading »
  • Posted on Sunday August 27, 2017
    Introduction In general, if we want to debug Linux Kernel, there are lots of tools such as Linux Perf, Kprobe, BCC, Ktap, etc, and we can also write kernel modules, proc subsystems or system calls for some specific debugging aims. However, if we have to instrument kernel to achieve our ... Continue Reading »
  • Posted on Saturday August 26, 2017
    Introduction As we know, network subsystems are important in computer systems since they are I/O systems and need to be optimized with many algorithms and skills. This article will introduce how QEMU/KVM [2] network part works. In order to put everything simple and easy to understand, we will begin with ... Continue Reading »
  • Posted on Sunday August 20, 2017
    Abstract Most popular task monitor systems (such as top, iotop, proc, etc) can only get tasks’ disk I/O information like tasks’ I/O utilization percentage every seconds due to kernel timer/tick frequency and high time cost of system interfaces. This article presents I/O Microscopy, a new way to get tasks’ disk ... Continue Reading »
  • Posted on Tuesday February 14, 2017
    Motivation Recently, I find it is hard to know the percentage of time that one process uses to wait for synchronous I/O (eg, read, etc). One way is to use the taskstats API provided by Linux Kernel [1]. However, for this way, the precision may be one problem. With this ... Continue Reading »
  • Posted on Sunday November 29, 2015
    Amazon S3 is a widely used public cloud storage system. S3 allows an object/file to be up to 5TB which is enough for most applications. The AWS Management Console provides a Web-based interface for users to upload and manage files in S3 buckets. However, uploading a large files that is ... Continue Reading »
Please share if you like this post:


  1. Note for blog authors: if you do not want your articles appear here (we just post a excerpt, not the full content), please drop me a message and I will delete them. If you have good suggestions on blogs/sites (with a RSS feed) to add to this list, please also let me know.

  2. Yeah, the poll() function is broken on MacOS and therefore is not supported in Python for the Mac.The select library supports other polling mechanisms; it essentially exposes whatever the OS supports. Let me look into an update to the code that will use kevent on Macs.

Leave a Reply

Your email address will not be published. Required fields are marked *