Apache

Programming Languages

Troubleshooting “invalid syntax, continuing” errors in sysctl
ByEric Ma Mar 24, 2018Apr 12, 2026

The “invalid syntax, continuing…” warning from sysctl typically means your /etc/sysctl.conf file has a malformed line that the parser can’t understand. Here’s how to diagnose and fix it. Identifying the Problem When you run sysctl -p to load configuration from /etc/sysctl.conf, any syntax error will be printed with a line number: sysctl -p warning: /etc/sysctl.conf(44):…

Read More Troubleshooting “invalid syntax, continuing” errors in sysctl
Design Patterns & Architecture

Configuring Hadoop’s Job Scheduling Policy
ByEric Ma Mar 24, 2018Apr 13, 2026

The YARN resource scheduler determines how cluster resources are allocated to jobs. By default, Hadoop uses the Capacity Scheduler, but you can switch to an alternative like the Fair Scheduler or configure different scheduling policies. Identifying Your Current Scheduler Check which scheduler is currently active by examining your configuration: grep -A 2 “yarn.resourcemanager.scheduler.class” $HADOOP_HOME/etc/hadoop/yarn-site.xml The…

Read More Configuring Hadoop’s Job Scheduling Policy
Linux & Systems Administration

Hosting Multiple Websites on Apache with Virtual Hosts
ByEric Ma Mar 24, 2018Apr 13, 2026

Virtual hosts allow you to serve multiple websites from a single Apache server and IP address. This is the standard approach for shared hosting and cost-effective server deployments. Basic Virtual Host Setup The foundation of multi-site hosting on Apache is the VirtualHost directive. Each domain needs its own block that specifies where its files live…

Read More Hosting Multiple Websites on Apache with Virtual Hosts
Linux & Systems Administration

Enabling .htaccess in Apache2
ByEric Ma Mar 24, 2018Apr 12, 2026

By default, Apache2 ignores .htaccess files. The AllowOverride directive controls which directives in .htaccess Apache will process. To enable .htaccess, you need to configure Apache to allow overrides. Enable AllowOverride Edit your Apache configuration file. On Ubuntu/Debian, this is typically: sudo nano /etc/apache2/apache2.conf Find the <Directory /var/www/> block and change AllowOverride None to AllowOverride All:…

Read More Enabling .htaccess in Apache2
Programming Languages

Using Google Custom Search for WordPress Queries
ByEric Ma Mar 24, 2018Apr 13, 2026

Google Custom Search Engine (now part of Programmable Search Engine) lets you offload search functionality to Google’s infrastructure. Redirecting WordPress search queries to CSE can improve search quality, especially for large sites with poor built-in search performance. How it works WordPress uses the s query parameter by default when processing searches from the search form….

Read More Using Google Custom Search for WordPress Queries
Systems & Architecture

Free SSL/TLS Certificates: Let’s Encrypt and Alternatives
ByEric Ma Mar 24, 2018Apr 13, 2026

Securing your website with HTTPS is no longer optional—it’s essential for SEO, user trust, and browser compatibility. The good news: Let’s Encrypt provides free certificates, and the tooling has matured significantly. Using Certbot with Nginx or Apache The standard approach is Certbot, which automates certificate issuance and renewal: sudo certbot –nginx This automatically configures HTTPS…

Read More Free SSL/TLS Certificates: Let’s Encrypt and Alternatives
Programming Languages

Force HTTPS Redirects with .htaccess
ByEric Ma Mar 24, 2018Apr 13, 2026

Forcing HTTPS on your website protects visitor data and improves SEO. The most common method is using .htaccess with Apache’s mod_rewrite module to redirect all HTTP requests to HTTPS. Basic HTTPS redirect Add this to your .htaccess file in the document root: RewriteEngine On RewriteCond %{HTTPS} !=on RewriteRule ^ https://%{HTTP_HOST}%{REQUEST_URI} [L,R=301] This performs a permanent…

Read More Force HTTPS Redirects with .htaccess
Programming Languages

Caching Mapper Output in Hadoop: Strategies for Reusing Intermediate Results
ByEric Ma Mar 24, 2018Apr 13, 2026

The core problem here is legitimate: if you’re running multiple jobs on the same dataset where the mapper phase produces identical intermediate results, recomputing those results is wasteful. However, skipping the mapper phase entirely breaks MapReduce’s processing model. There are better approaches. Why You Can’t Just Skip the Mapper MapReduce assumes data flows through map…

Read More Caching Mapper Output in Hadoop: Strategies for Reusing Intermediate Results
Programming Languages

Configuring HDFS Replication Factors by Directory
ByEric Ma Mar 24, 2018Apr 12, 2026

HDFS doesn’t natively support directory-level replication factor inheritance. Even if you set a specific replication factor on a directory and its files, new files created in that directory will default to the cluster’s global dfs.replication setting (typically 3). This limitation can complicate multi-tier storage strategies where you want temporary or low-priority data on fewer replicas…

Read More Configuring HDFS Replication Factors by Directory
Linux & Systems Administration

Listing Running Services on Linux with systemctl
ByEric Ma Mar 24, 2018Apr 13, 2026

Listing active services is essential for system monitoring and troubleshooting. Whether you’re investigating a hung process, checking if a daemon started correctly, or auditing what’s actually running on your system, knowing how to query service status quickly saves time. Basic Command The standard way to list running services across all modern Linux distributions is: systemctl…

Read More Listing Running Services on Linux with systemctl
Development Best Practices

Spark SQL: DDL and DML Operations Explained
ByEric Ma Mar 24, 2018Apr 12, 2026

Spark SQL doesn’t have a separate DDL/DML specification distinct from Hive QL — it inherits its SQL dialect directly from Hive. If you’re designing a SQL engine or looking to understand Spark SQL’s data definition and manipulation capabilities, you need to reference Hive’s DDL and DML documentation. Why Spark SQL Uses Hive QL Spark SQL…

Read More Spark SQL: DDL and DML Operations Explained
Linux & Systems Administration

Extract Linux Logs Within a Specific Time Range
ByEric Ma Mar 24, 2018Apr 13, 2026

When processing application logs (like Hadoop/log4j output), you often need to extract entries from a specific time window. This is especially common in automated routines that run periodically—for example, pulling the last 4 hours of logs every 4 hours via cron. Log format considerations Most application logs follow a consistent timestamp format. For example, log4j…

Read More Extract Linux Logs Within a Specific Time Range
Linux & Systems Administration

Redirect HTTP to HTTPS Using Apache mod_rewrite
ByEric Ma Mar 24, 2018Apr 13, 2026

If you want to force HTTPS on your site, add this to your .htaccess file: RewriteEngine On RewriteCond %{HTTPS} off RewriteRule ^(.*)$ https://%{HTTP_HOST}%{REQUEST_URI} [L,R=301] The key parts: RewriteCond %{HTTPS} off — matches only HTTP requests ^(.*)$ — matches any request https://%{HTTP_HOST}%{REQUEST_URI} — rebuilds the URL with HTTPS [L,R=301] — uses a permanent redirect (301) and…

Read More Redirect HTTP to HTTPS Using Apache mod_rewrite
Databases & Storage

Clear Cached Redirects in Chrome: A Developer’s Guide
ByEric Ma Mar 24, 2018Apr 12, 2026

Chrome aggressively caches HTTP redirects—especially 301 (permanent) and 308 redirects—as a performance optimization. The cache persists across browser sessions, which causes real problems when you’ve updated redirect rules and need to test them immediately. Understanding Chrome’s Redirect Cache Behavior Chrome stores redirect information at the browser level, separate from typical page caches. This means: 301…

Read More Clear Cached Redirects in Chrome: A Developer’s Guide
Design Patterns & Architecture

Understanding Hadoop Configuration Files: Locations and Defaults
ByEric Ma Mar 24, 2018Apr 13, 2026

Hadoop uses three primary configuration files to define YARN, HDFS, and MapReduce behavior: HDFS: hdfs-site.xml YARN: yarn-site.xml MapReduce: mapred-site.xml These files live in $HADOOP_HOME/etc/hadoop/ and override the built-in defaults when present. Finding Official Default Values Apache publishes default configuration documentation for each release. For current versions: Hadoop 3.4.x (Latest) HDFS defaults: https://hadoop.apache.org/docs/r3.4.0/hadoop-project-dist/hadoop-hdfs/hdfs-default.xml YARN defaults: https://hadoop.apache.org/docs/r3.4.0/hadoop-yarn/hadoop-yarn-common/yarn-default.xml…

Read More Understanding Hadoop Configuration Files: Locations and Defaults
Development Best Practices

Understanding YARN: Resource Management and Cluster Fundamentals
ByEric Ma Mar 24, 2018Apr 12, 2026

YARN (Yet Another Resource Negotiator) fundamentally restructured Hadoop 2.0 by decoupling resource management from application logic. If you’re transitioning from Hadoop 1.x or building systems on top of YARN, understanding its architecture is essential for effective cluster administration and application development. Essential Reading The foundational paper Start with “Apache Hadoop YARN: Yet Another Resource Negotiator”…

Read More Understanding YARN: Resource Management and Cluster Fundamentals
System Administration & Cloud

Custom Headers and Footers in Apache Directory Listings
ByEric Ma Mar 24, 2018Apr 12, 2026

Custom headers and footers in Apache directory listings let you add branding, instructions, or navigation without modifying the core file listing. This is useful for public file archives, download directories, or any indexed directory you want to customize. Basic Setup with .htaccess Add these directives to .htaccess in the directory where you want custom listings:…

Read More Custom Headers and Footers in Apache Directory Listings
Design Patterns & Architecture

Configuring Hadoop Classpath for MapReduce Compilation
ByEric Ma Mar 24, 2018Apr 13, 2026

When compiling MapReduce jobs against a Hadoop installation, you need to include the correct classpath to resolve Hadoop dependencies. The yarn classpath command handles this automatically. Getting the classpath Run this command to output the full classpath: yarn classpath If yarn isn’t in your $PATH, use the full path: $HADOOP_HOME/bin/yarn classpath Replace $HADOOP_HOME with your…

Read More Configuring Hadoop Classpath for MapReduce Compilation
Linux & Systems Administration

WordPress Multisite: Running from a Subdirectory
ByQ A Mar 24, 2018Apr 12, 2026

Installing WordPress in a subdirectory keeps your site root cleaner and makes it easier to manage core files separately from content. Here’s how to configure WordPress so the admin and core files live in /wordpress while the site appears at your domain root. Creating the Subdirectory Structure First, create the WordPress directory and set proper…

Read More WordPress Multisite: Running from a Subdirectory
Linux & Systems Administration

Configure PHP’s Maximum Execution Time and Memory Limits
ByQ A Mar 24, 2018Apr 13, 2026

You’re seeing fatal errors because PHP has hit its default resource constraints: PHP Fatal error: Maximum execution time of 30 seconds exceeded PHP Fatal error: Allowed memory size of 268435456 bytes exhausted These limits exist to prevent runaway scripts from consuming server resources. Here’s how to increase them. Via php.ini (Global Configuration) The most reliable…

Read More Configure PHP’s Maximum Execution Time and Memory Limits