Vincent JiangHow do you analyze data directly from MySQL, PostgreSQL, and MongoDBIf you’re using MySQL workbench, PgAdmin or some other visual database design tools, you probably wonder why it is so hard to use them for…Mar 12, 2021Mar 12, 2021
Vincent JiangDifference between ETL and ELTTraditionally, companies use ETL pipeline to connect production systems with data warehouses. This however has changed a lot in recent…Dec 26, 2020Dec 26, 2020
Vincent JiangHow to split large .CSV file in batchesIf you are dealing with a lot of large .CSV files, and you want to split them into small files, there’re typically two things that you can…Dec 20, 2020Dec 20, 2020
Vincent JiangHow to do data cleansing?Data cleansing (cleaning) is not a very structured practice since data comes in all different sizes and formats. In general, theses are…Dec 9, 2020Dec 9, 2020
Vincent JiangDifference between data analyst, data engineer and data scientistEver wonder why you’re expected to deliver before accepting your new job as a data analyst? There are actually significant differences…Dec 9, 2020Dec 9, 2020
Vincent JiangHow remote working (work from home) can be efficient tooLike it or not, remote working can be exhausting. Many teams around the world are coping with this “new norm” in one way or another. Our…Dec 9, 2020Dec 9, 2020
Vincent JiangThe difference between spreadsheet and databaseThis feels like a century old question, “can I use Google Sheet or Excel as a database?” The short answer is unfortunately no. The efforts…Dec 4, 2020Dec 4, 2020
Vincent JiangHow to access big .CSV file?Bigger .CSV files are not very accessible in most desktop programs. In fact, there is a hard limit of 1,048,576 rows and 16,384 columns in…Nov 6, 2020Nov 6, 2020
Vincent JiangWhy does Excel crash all the time?Maybe you’ve experienced this in the past, Excel is sometimes not the most stable application on your computer. Though the reasons behind…Nov 6, 2020Nov 6, 2020