We've just added support to the SigParser API and the SigParser application for Portuguese, Russian and Dutch email splitting and signature extraction. The only thing we are missing is inline phone number detection for these new languages.
SigParser overall supports 7 languages including Dutch, English, French, German, Portuguese, Russian and Spanish.
Other Email Parsing Improvements
- Improved the number of titles SigParser can discover across all languages.
- Fixed incorrect email header detection in some scenarios. For example, if a line in an email says "From: email@example.com" SigParser won't try to split the email unless there are other factors to indicate this is really an email split and not just a part of the email.
- Better handling of some legacy emails generated by old clients from around 2008.
Ignore Contacts on Large Meetings
When a meeting has more than 20 attendees we don't create contact records for all of those contacts. For example, if you got invited to a party with 100 people those probably aren't valuable contacts in isolation. As a result we don't import those contacts (email addresses) into SigParser. If you have an email later with one of them or a few of them then SigParser will import them at that time.
Google Calendar Syncs More Data
Previously SigParser would sync Google Calendar events for the last six months. Going forward SigParser will sync events based on the max history date for the mailbox. This is the same date used for pulling emails. This means if you purchase 2 years of history, SigParser will examine 2 years of the calendar events.