Talend exercises in Golang: ‘Advanced’ XML

This is a port of the previous post to golang. Golang is the closest I’ve gotten to writing production-code in a statically typed C-like language. Its nice, though I still can’t get over reading The Go Programming Language and seeing them refer to C and Go as high-level languages. Having started out in Lisp, it […]

Talend exercises in Clojure: ‘Advanced’ XML

This time, we convert a CSV-file to an XML file, but place the order-attributes not as content tags in the XML, but as attributes of each ‘ORDER_LINE’ tag. For an unknown reason, this part is titled ‘Advanced XML output for complex XML structures’. It took 54 lines of code in Clojure. Feast your eyes on […]

Talend exercises in Clojure

In the last few weeks I’ve gone through two more exercises from the Talend book, and its still very basic right now. It was slow going at first, since I’ve not used the CSV and XML tools in Clojure before now. But the last exercise went by a whole lot faster now that I know […]

Talend to Clojure

I’ve had the displeasure of being introduced to the Talend data tool a few months ago. I hate it. Its source-code should be printed out, then burned, and the ashes should be flushed into a cess-pit of nuclear waste. The Talend people peg it as an easy to use tool to get data massaged into […]

Moving on to RubyMine

A few months ago at my last job, I was introduced to a senior programmer who had just started using Ruby. Since this was his first gig in Ruby, and he used to work with .Net projects before this, he wasn’t comfortable with the Ruby command-line way of programming (Edit program in text editor, then […]

Machine learning with Apache Spark

Installing Apache Spark on Ubuntu Download from the project page. Unzip into a directory. For example, I put all my custom stuff in `~/bin`. Add the bin directory of the exploded tar-file to your PATH variable. Running `which spark-shell` should point to the correct path. Restart your shell if it doesn’t. Books Fast Data Processing […]

Getting back to machine learning

In the last few months I’ve gotten back to working on AI. I was talking to a co-worker of mine about the pleasance project, and doing that got me bit by the bug again. I’ve had my new multicore desktop computer for a while now, and I should use it for parallel processing problems. Pleasance-jruby […]

Automatically start rails server with upstart

The following is how to set up an Upstart file so you can have a rails application autostart upon the startup of a server. Modify the “USERNAME” and “TAG_FOR_THIS_PROCESS_IN_SYSLOG” fields to something that you want. It will run the server as the user you declare, so that the server isn’t run as the root user […]

Docker and Torquebox

My OVH box went down a few weeks ago and refused to get back up. I couldn’t even ssh in. No clue why. On the bright side, I got to use this as an opportunity to learn more about keeping things fail-safe. I’d say “more fail-safe”, but apart from github for my source-code, nothing was […]

Update to JParallel

I began the JParallel library a little while ago (Right after the post about not doing my own parallel programming (It doesn’t make a lot of sense)), and nearly immediately got a response from Mohamed Hafez on the issue tracker recommending I add a bunch of features. And a week later he implemented things in […]