Readings on Systems

We enjoy readings.

Books

Here are good books/ebooks that we hope you will find enjoy reading: books.

Posts on the Web

Here is a collection of articles and news on scalable systems. The links are updated via RSS sources. You can subscribe to this page via RSS feed or by email.

  • Posted on Monday August 02, 2021
      What would a totally new search engine architecture look like? Who better than Julien Lemoine, Co-founder & CTO of Algolia, to describe what the future of search will look like. This is the first article in a series. Search engines, and more generally, information retrieval systems, play a central role ... Continue Reading »
  • Posted on Friday June 25, 2021
    Hey, it's HighScalability time!  Only listen if you want a quantum earworm for the rest of the day.   Not your style? This is completely different. No, it’s even more different than that. Today in things that nobody stopped me from doing:The AWS Elastic Load Balancer Yodel Rag. pic.twitter.com/ocyVLf8WlU — Forrest Brazeal (@forrestbrazeal) May 28, ... Continue Reading »
  • Posted on Tuesday June 08, 2021
    This post introduces common methods to prevent Linux memory fragmentation, the principle of memory compaction, how to view the fragmentation index, etc. Continue Reading »
  • Posted on Friday April 30, 2021
    Hey, HighScalability is back! This channel is the perfect blend of programming, hardware, engineering, and crazy. After watching you’ll feel inadequate, but in an entertained sort of way.   Love this Stuff? I need your support on Patreon to keep this stuff going.   Do employees at your company need to know about the cloud? ... Continue Reading »
  • Posted on Thursday April 09, 2020
    Querying transaction content out from a blockchain network is a common practice used by common scenarios like exploring the blockchain history or verifying the blockchain transaction content from a known ID. In Hyperledger Fabric, the transaction can be queried using a special system chaincode QSCC (Query System Chaincode) which is ... Continue Reading »
  • Posted on Wednesday April 08, 2020
    This post lists some general- and special-purpose blockchains nowadays available with a short description in one sentence. Readers can have a quick understanding of what a blockchain network is built for and what uniqueness that blockchain has. I also give links to beginner friendly resources as learn materials. General purpose ... Continue Reading »
  • Posted on Tuesday April 07, 2020
    Hyperledger Fabric is a consortium blockchain system. It’s performance is relatively good and its modular architecture enables it to be usable in many scenarios. Hyperledger Fabric itself has rich documents and samples of test networks. For beginners, deploying a new network for trying and testing still consumes quite some time. ... Continue Reading »
  • Posted on Tuesday January 28, 2020
    OneDrive is one of the good cloud storage services available and there is a business version called OneDrive for Business. Microsoft’s Office 365 plan is widely used including Exchange Email service and OneDrive for Business. However, there is no official client released yet for Linux users. Insync is a third ... Continue Reading »
  • Posted on Tuesday November 27, 2018
    Reading: Years in Big Data. Months with Apache Flink. 5 Early Observations With Stream Processing: https://data-artisans.com/blog/early-observations-apache-flink. The article suggest adopting the right solution, Flink, for big data processing. Flink is interesting and built for stream processing. The broader view and take away may be to solve problems using the right ... Continue Reading »
  • Posted on Saturday March 24, 2018
    How to deactivate a LVM logical volume activated by #vgchange -aay on Linux You may need to make a LVM volume group inactive and thus unknown to the kernel. To deactivate a volume group, use the -a (--activate) argument of the vgchange command. To deactivates the volume group vg, use ... Continue Reading »
  • Posted on Saturday March 24, 2018
    One of HDFS cluster’s hdfs dfsadmin -report reports: Under replicated blocks: 139016 Blocks with corrupt replicas: 9 Missing blocks: 0 The “Under replicated blocks” can be re-replicated automatically after some time. How to handle the missing blocks and blocks with corrupt replicas in HDFS? Understanding these blocks A block is ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The encoding of x86 and x86-64 instructions is well documented in Intel or AMD’s manuals. However, they are not quite easy for beginners to start with to learn encoding of the x86-64 instructions. In this post, I will give a list of useful manuals for understanding and studying the x86-64 ... Continue Reading »
  • Posted on Saturday September 09, 2017
    The metadata checkpointing in HDFS is done by the Secondary NameNode to merge the fsimage and the edits log files periodically and keep edits log size within a limit. For various reasons, the checkpointing by the Secondary NameNode may fail. For one example, HDFS SecondaraNameNode log shows errors in its ... Continue Reading »
  • Posted on Sunday August 27, 2017
    Introduction In general, if we want to debug Linux Kernel, there are lots of tools such as Linux Perf, Kprobe, BCC, Ktap, etc, and we can also write kernel modules, proc subsystems or system calls for some specific debugging aims. However, if we have to instrument kernel to achieve our ... Continue Reading »
  • Posted on Saturday August 26, 2017
    Introduction As we know, network subsystems are important in computer systems since they are I/O systems and need to be optimized with many algorithms and skills. This article will introduce how QEMU/KVM [2] network part works. In order to put everything simple and easy to understand, we will begin with ... Continue Reading »
  • Posted on Sunday August 20, 2017
    Abstract Most popular task monitor systems (such as top, iotop, proc, etc) can only get tasks’ disk I/O information like tasks’ I/O utilization percentage every seconds due to kernel timer/tick frequency and high time cost of system interfaces. This article presents I/O Microscopy, a new way to get tasks’ disk ... Continue Reading »
  • Posted on Tuesday February 14, 2017
    Motivation Recently, I find it is hard to know the percentage of time that one process uses to wait for synchronous I/O (eg, read, etc). One way is to use the taskstats API provided by Linux Kernel [1]. However, for this way, the precision may be one problem. With this ... Continue Reading »
  • Posted on Sunday November 29, 2015
    Amazon S3 is a widely used public cloud storage system. S3 allows an object/file to be up to 5TB which is enough for most applications. The AWS Management Console provides a Web-based interface for users to upload and manage files in S3 buckets. However, uploading a large files that is ... Continue Reading »
  • Posted on Tuesday March 10, 2015
    Retail is one of the most important business domains for data science and data mining applications because of its prolific data and numerous optimization problems such as optimal prices, discounts, recommendations, and stock levels that can be solved using data analysis methods. The rise of omni-channel retail that integrates marketing, ... Continue Reading »
  • Posted on Sunday September 14, 2014
    Hadoop 2 or YARN is the new version of Hadoop. It adds the yarn resource manager in addition to the HDFS and MapReduce components. Hadoop MapReduce is a programming model and software framework for writing applications, which is an open-source variant of MapReduce designed and implemented by Google initially for ... Continue Reading »

2 comments

  1. Note for blog authors: if you do not want your articles appear here (we just post a excerpt, not the full content), please drop me a message and I will delete them. If you have good suggestions on blogs/sites (with a RSS feed) to add to this list, please also let me know.

  2. Yeah, the poll() function is broken on MacOS and therefore is not supported in Python for the Mac.The select library supports other polling mechanisms; it essentially exposes whatever the OS supports. Let me look into an update to the code that will use kevent on Macs.

Leave a Reply

Your email address will not be published. Required fields are marked *