In some cases, breaking your larger programming problems into smaller, parallelizable units makes sense from a time complexity problem. If the work you are trying to perform exhibits some of these parallelizable characteristics, you should only need to wait for the longest of your jobs to finish.
In today’s post, we’ll be talking about GNU Parallel.
A summary from their website:
GNU parallel is a shell tool for executing jobs in parallel using one or more computers. A job can be a single command or a small script that has to be run for each of the lines in the input. The typical input is a list of files, a list of hosts, a list of users, a list of URLs, or a list of tables. A job can also be a command that reads from a pipe. GNU parallel can then split the input and pipe it into commands in parallel.
Input
The input system is quite complex. Delimiting the inputs with the ::: operator, parallel will make a catersian product out of the input values.
parallel echo ::: John Paul Sally
John
Paul
Sally
parallel echo ::: John Paul Sally ::: Smith
John Smith
Paul Smith
Sally Smith
parallel echo ::: John Paul Sally ::: Smith Jones Brown
John Smith
John Jones
John Brown
Paul Smith
Paul Jones
Paul Brown
Sally Smith
Sally Jones
Sally Brown
Linkage is possible using :::+, should this flexibility be required.
parallel echo ::: John Paul Sally :::+ Smith Jones Brown
John Smith
Paul Jones
Sally Brown
In today’s article, we’ll talk a bit about the HDF5 format. From the HDF5 website:
An HDF5 file is a container for two kinds of objects: datasets, which are array-like collections of data, and groups, which are folder-like containers that hold datasets and other groups.
GNU Grep is an amazingly powerful tool that will allow you to search input files containing specified patterns.
The following command performs a recursive search for a string, inside a directory of files:
grep-rnw'/path/to/search'-e'pattern-to-find'
Breaking this command down:
-r makes the execution recursive
-n will give you the line number
-w matches the whole word
You can use the --exclude switch to remove file patterns from the set of files included in your search. Removing js and html files from your search might look like this:
--exclude*.{js,html}
The opposite of this will be the --include switch.
For further details on this really useful tool, check out the man page.
Sometimes you might find yourself in the situation where you require a little more power out of your node.js application. You may need to squeeze some extra performance out of a piece of code that you simply can’t achieve using javascript alone. Node.js provides a very rich sdk to allow application developers to create their own addons to use, that allow you to write in C++.
These binary compiled modules then become directly accessible from your node.js applications.
In today’s article, I’d like to walk through the basic setup of an addon project. We’ll also add a function to the addon, and demonstrate the call from javascript to C++.
Setup
Before you can get developing, you’ll need to make sure you have some dependencies installed. Create a directory, and start a new node application.
mkdir my-addon
cd my-addon
npm init
You’ll need to let the package manager know that your application has a gyp file present by switching gypfile to true.
The project is going to require a gyp file called binding.gyp. It’s the responsibility of this file to generate the build environment that will compile our addon.
The keen reader would see that our module does nothing. That’s ok to start with. This will be an exercise in checking that the build environment is setup correctly.
Import and use your addon just like you would any other module from within the node environment.
The getGreeting function is actually doing the work here. It’s simply returning a greeting. The InitAll function now changes to add a Set call on the exports object. This is just registering the function to be available to us.
Greetings
So, now we can actually use the greeting. We can just console.log it out.