Xomnia is a word combining the letter ‘X’ – the unknown – and “Omnia” – Latin for everything. Our team of data scientists and big data engineers are trained to find the undefined – X – in all the relevant data sources – Omnia. This unknown – X – is untapped business value. Combining the X and Omnia you get the Xomnia spirit. Eager, curious and dedicated people, who have the belief that the future is big data.
For the Municipality of Utrecht, searching through a digital archive containing 10,000 scans of PDF documents from over 500 real estate properties including drawings is a technical challenge, and can be quite a headache. To search through these documents, the municipality used an application with an inefficient search system that didn’t allow for targeted searches, and employees spent hours waiting for the system to search. An alternative was needed.
Municipality Utrecht’s Property Management department is responsible for maintaining real estate. The activities primarily relate to corrective maintenance, and preventive & planned maintenance. Depending on the nature of the work, the activities are carried out on a project basis.
Xomnia designed and implemented a smart search engine comparable to Google. The ranking of the documents makes use of a smart algorithm that puts the document that is most likely the correct result at the top of the list, and pops open results in a new tab.
We built a backscan pdf search engine for the municipality Utrecht. The web server Nginx was used, and the code is written in Python in the Flask framework. First the pdfs are parsed to make sure everything in the PDF is searchable. Wand is used to make the images searchable, and Tika is used for the metadata. Elasticsearch is then used to search the PDFs for the given query.
The end result has saved employees a lot of time. This means they have also saved on costs. Previously, searching for a document could take up to an hour, and now it takes just minutes.