Writing

Recovering MongoDB data when WiredTiger.wt is corrupted

How to recover MongoDB data when the WiredTiger.wt file is corrupted

Feature engineering for time series forecasting

A simple guide to feature engineering techniques for time series forecasting using XGBoost and Polars

More than 40% of job postings on LinkedIn are fake

A quick analysis of job postings on LinkedIn to determine the percentage of fake job postings.

SQL vs dplyr vs pandas

A comparison of SQL, dplyr, and pandas syntax for data manipulation.

Shared remote development environment for a research lab

My recount on setting up a computational environment for our research group of 10 members. Most of the members are PhD students and postdocs. The goal was to let everyone have easily access to a powerful computational environment with simple setup so that they can focus on their research.

Methods to get unique values from arrays in JavaScript and their performance

A quick overview of methods to get unique values from arrays in JavaScript, including performance comparisons.