I used DeepLearning4j to train word2vec model. Then I had to save the dictionary to CSV so I can run some clustering algorithms on it. Sounded like a simple task, but it took a while, and here is the code to do…
Xanda BI Toolkit: clustering
In the previous post we introduced the toolkit release to open source and the general idea behind the project, now I would like to share clustering implementation. At this point we implemented 3 clustering algorithms: K-means Very straight-forward algorithm K-means is memory-friendly and…
Xandra BI Toolkit powered by ML released to Open Source
We are happy to announce that will be partially releasing our Python Business Intelligence Toolkit powered by machine learning algorithms to open-source. Idea The idea behind the Xandra BI toolkit is to provide an easy way for companies to arrange, process,…
5 programming languages to fall in love with on St. Valentine’s Day.
Saint Valentine’s Day is a holiday of love not only toward your beloved one or family, but also to things like… programming languages. We would like to outline 5 programming languages to fall in love with on St. Valentine’s Day. Python…
How to parse dynamic HTML content using Python
In the previous tutorial we learning how to parse HTML in Python. In the Python tutorial we are going to learn to to parse dynamic HTML content generated by JavaScript, jQuery, Ajax, Angular or other dynamic pages technology. What’s the problem with…
How to parse emails from HTML in Python
In this tutorial we are going to get an idea of how to parse emails from HTML using Python. Python is a scripting language easy to get started and is perfect for tasks like parsing emails. So let’s elaborate an approach…
Update XML node in Python
I like python because it’s minimalistic and elegant.Let’s see how to update an XML node using ElementTree. We use CD catalog in XML as a datasource. Here is the python script itself. And the output is the following:
Remove duplicate lines from a file in Scala
How to remove duplicate lines from csv or txt file? The answer is quite straightforward: you basically need BufferedReader and BufferedWriter, and this also works for large files quite well.
Top 5 useful Java Libs
Java is an advanced language, but nonetheless there are libs to make life even more easier. We would like to share 5 useful libs to help you with projects of different kind. FileUtils – Apache Commons Small but a very useful lib…
Python networking example
Here is a small example demonstrating get requests in Python. And the code itself