Infomaniak is developing its AI to improve corporate productivity

Published on August 27, 2021 by Team Infomaniak

Infomaniak is preparing for the arrival of artificial intelligence (AI) in its productivity tools in order to improve corporate productivity. Privacy, confidentiality and data management are at the heart of our development, as Infomaniak makes no concessions on these points. Available as an option, the AI developed by our teams will respect the strict framework of the GDPR and Swiss data protection legislation.

Work more quickly with kDrive and webmail

Our AI will be capable of finding and categorising your files and attachments so that you can quickly access your:

bills
receipts
sales receipts
CV
delivery slips
expense reports
estimates
payslips
contracts
identity documents
etc.

Each document will be automatically labelled. Simply click on a label to display the desired content category. For example, you can display bills from a single supplier, sales slips from a particular month or payslips for the past year. Eventually, it will even be possible to create your own labels.

Ethical artificial intelligence

“AI will simplify company work flows. People will find the information they are looking for more quickly and spend less time sorting or categorising data. We will develop this technology in our own way, first and foremost by protecting privacy and confidentiality. We will always ensure that AI works for the users and not for other purposes.”
Marc Oehler, CEO of Infomaniak

Activation is optional. You keep control.

We use so-called “weak” artificial intelligence, as with all systems currently in existence. That means that this AI is not aware of itself, does not have any sensitivity and even less so its own mind. It is not, therefore, Terminator 😉

It is an automatic learning system operated by an artificial network of neurones: deep learning. This technology give computers the ability to learn. The machine uses this to analyse data in order to take a decision in line with the situations encountered. It becomes capable, for example, of isolating shapes and faces, distinguishing typed characters, identifying signals on the stock markets, detecting fraud and even contributing to medical diagnoses. At Infomaniak, this network of neurones is managed internally and is entirely contained within our infrastructure in Switzerland.

The learning is initially developed via public data

To train itself and provide pertinent results, our AI is dependent on two things:

the questions we ask it
the data it uses to learn

We implement a network of neurones then give it problems to solve using public data. It involves nothing more than mathematics and statistics that we could calculate manually. AI simply enables these calculations to be repeated more quickly. This is what makes it interesting.

How exactly does it work?

The model learns to distinguish types of content using freely accessible sets of public data.

We provide it with rules and basic characteristics to enable it to train itself to find the best possible combination between them in order to distinguish holiday photos from photos of documents, photos of bills from ID photos, etc.

This is how it works:

First: the initial analysis and training phase

To detect the documents, our model observes the pixels to produce a set of rules which will represent characteristics such as:

a rate of white pixels in excess of 80%
the absence of people in the image
regular margins
a square or rectangular shape marked in the photo
etc.

At the end of this initial training, the algorithm is capable of ignoring private photos to direct itself spontaneously towards photos likely to be documents in order to categorise them subsequently using the labels.

Second: the improvement and reinforcement phase

Determining and labelling files with the right labels is only half the job. The algorithm must then learn from the user who will necessarily change the labels so that they correspond more precisely to their files. This is the case, for example, when a user changes a large number of labels. This so-called implicit user feedback involves taking the user’s actions into account in order to improve the relevance of the algorithm so as to avoid making the same mistakes.

These steps are repeated continuously in order to feed the algorithm, enhance the learning process and increase the relevance. By adjusting the weight and threshold of the different characteristics, the AI learns to recognise certain files more accurately in order to assign the correct label. In the long term, it no longer makes any mistakes.

An individual AI, isolated from other users

No spillover possible. For personal feedback, we provide for each user being able to train, so to speak, “their own AI”. No data can be leaked to another user. Your “honeymoon” label will remain your label. The same is true for a personalised label such as “after nightclub”. No one other than you will be aware of these.

Sometimes, we need to help the machine

If a user removes 90% of the labels from their bills, for example, this is probably because our algorithm made mistakes. In complex situations, an AI specialist at Infomaniak may need to check a file or series of files manually in the event of an error. The AI may be unable to resolve the problem posed and thus find the right label. If this problem cannot be solved, if – and only if – it is a document, the machine requires help to be able to move forwards and correct its mistake.

Files with redacted personal data and metadata

Before being subjected to a manual check, all information making it possible to trace the owner must be redacted from a document.

The machine removes the link with the account from which it comes, deletes the metadata (location, time, date, device, etc.) and any words that are too specific such as first names, nicknames, etc. Once it is longer possible to identify who a file belongs to, the manual check can be started to ensure the algorithm is on the right track. In this way, we ensure that the AI always provides you with satisfactory results while respecting user privacy as far as possible.

Our commitment to ethical AI

Our vision of artificial intelligence is clear: the aim is to increase the productivity of our users without sacrificing privacy or confidentiality.

Your data only belong to you
Our AI technology is independent
It is developed in-house in Switzerland
It involves no external service provider
It is disconnected from other users
Everything is hosted in Switzerland in our own infrastructures
Activation of AI is optional and deactivated by default
The information is used solely to improve your services and in no event for marketing purposes
Manual inspection of files is exceptional and does not make it possible to link a piece of data to its owner
The data are irreversibly anonymised (differential privacy)

These actions are currently under development. It must nevertheless be kept in mind that these are our intentions in the field of AI and that we cannot forecast a schedule too precisely or confirm such and such a function with compete certainty. This said, we felt it was important to share our vision of artificial intelligence with you and we will continue to keep you informed of the progress made via the blog and our social networks.

Infomaniak is developing its AI to improve corporate productivity

Work more quickly with kDrive and webmail

Ethical artificial intelligence

Activation is optional. You keep control.

The learning is initially developed via public data

How exactly does it work?

First: the initial analysis and training phase

Second: the improvement and reinforcement phase

An individual AI, isolated from other users

Sometimes, we need to help the machine

Files with redacted personal data and metadata

Our commitment to ethical AI

Not yet at Infomaniak?

Join our movement for a more ethical cloud

On the same subject

Euria: the free, sovereign AI assistant to no longer depend on the American giants

Email encryption is available for all Infomaniak email addresses

Infomaniak launches a managed Kubernetes service in its sovereign Public Cloud