SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Downloaden Sie, um offline zu lesen
© 2015 Mirai Translate, Inc. All rights reserved.
Mirai Translate, Inc.
1
Impossible only means
that you have still screwed up the solution.
-Mick Etoh
© 2015 Mirai Translate, Inc. All rights reserved.
2
Number of Inbound Visitors
in 2014
13,413,567
JPY2,030,500,000,000
EUR15,522,200,000
© 2015 Mirai Translate, Inc. All rights reserved.
Translation Total
Addressable Market (2014)
3
USD  2.1B
MT market    
USD  10M
© 2015 Mirai Translate, Inc. All rights reserved.
Unforeseen Challenges Ahead
4
Translation Speed (1/cost)
Quality
LSP Solutions
IP
Publication
Reports
CAT
Speech
Translater
Google
Translate
Web
Crowd Sourcing
Solutions
SOHO SOHO
MT+Post Editing Solutions
MT Real Time Solutions
Unforeseen New Market Frontier
性能向上による
新領域
© 2015 Mirai Translate, Inc. All rights reserved.
72% of Japanese don t speak English.
5
© 2015 Mirai Translate, Inc. All rights reserved.
6
Vision
To realize a society in which everyone can interact freely across language
barriers with the use of machine translation technology, and thereby
contribute to invigoration and innovation in businesses.
Mirai Translate, Inc.
© 2015 Mirai Translate, Inc. All rights reserved.
Mirai  Translate  as  Joint  Venture
7
Mobile  Platform  Leader ASR  &  MT  Solution  Provider Multilingual  Enterprise  MT  developer
NLP  and  MT  technology  leader Multilingual  SMT  technology  leader
Technology  
Transfer
© 2015 Mirai Translate, Inc. All rights reserved.
8
Our Competence
• Multiple Translation Engines
from Systran and NICT
• MT Training Tools from Systran
• NLP Tools
Named Entity Extraction, Pre-Ordering,…
• NL Data Assets
Corpus from Systran and NTT DOCOMO+ JPN
Ontology Dictionary
• Strong Technical Team
Experiences in AWS, Data Mining, MT
toward our own original MT systems.
© 2015 Mirai Translate, Inc. All rights reserved.
Siri
9
Big-Data, Big-Server, and Fat-Pipe Solution
© 2015 Mirai Translate, Inc. All rights reserved.
Shabette-Concier Voice agent service
• Launched Mar. 1, 2012
• Over 40 services in it
• Including chatting
• 10 million users
Shabette
Voice
=
Concier
Concierge
=
How may I help you?
10
© 2015 Mirai Translate, Inc. All rights reserved.
Touch the Concier.“Tell me how to make a pizza.”View a list of recipes of pizza.You can check a detailed recipe of pizza.“Tell me Italian restaurants nearby.”View a list of Italian restaurants.You can check detailed information of restaurants.11
© 2015 Mirai Translate, Inc. All rights reserved.Touch the Concier.Q: “What is the height of Mt. Fuji?”A: “3,766m!”Q: “When is holding schedule of the Tokyo Olympic Games?”A: “It will hold in 2020.” 12
© 2015 Mirai Translate, Inc. All rights reserved.
Basic Architecture 2010
Logging
Fuetrek Voice

Recognition
DOCOMO Task

Recognition
Logging
Voice
text
text contents
Service
Providers’ DB
contents
text
Text to speech
13
Fat-Pipe
Big-Servers
© 2015 Mirai Translate, Inc. All rights reserved.
Mirai Architecture 2015
Logging
Fuetrek Voice

Recognition
Mirai MT
Engines
Logging
Voice
text
text contents
Client Dictionary
Corpus DB
contents
text
Text to speech
14
© 2015 Mirai Translate, Inc. All rights reserved.
15
© 2015 Mirai Translate, Inc. All rights reserved.
We are Cloud Natives
16
システム構成部品
who believe our cloud
solution is scalable and safer!
© 2015 Mirai Translate, Inc. All rights reserved.
Bilingual	
  User	
  
Dictionaries
SYSnitionTRAN	
  7	
  HYBRID	
  ENGINE
SYSTRAN  Hybrid  Architecture
17
Source
Transl
ation
Main	
  Dictionaries	
  
Linguistic	
  Rules
User	
  Entities
Rules-­‐Based	
  
MT
Statistical	
  
Post-­‐
Edition
SBS BS
Target	
  
Monolingual	
  
Corpus
Source	
  
Adaptation
BS
Monolingual	
  
Source	
  Corpus
Bilingual	
  Corpus	
  or	
  
Translation	
  Memories
Bilingual	
  
Translation	
  
Models
Target	
  
Language	
  
Models
Source	
  
Language	
  
Models
Self-­‐training
Source	
  
Normalization	
  
Dictionaries
Self-­‐Training
Self-­‐Training
SBS
Statistical	
  MT
Translation	
  
Memories
Bilingual	
  
Terminology	
  
Extraction
	
  Spell	
  Check
Homographs
Target	
  
Normalization	
  
Dictionaries
Translation	
  	
  
Memories
Pre-Filter Formating
Normalization
Segmentation
Entity
Recognition
Translation Memory
User Dictionary Match
Post-Processing
Formatting
Normalization
Post-Filter
a Commercial
SMT Engine
© 2015 Mirai Translate, Inc. All rights reserved.
NTT  Technology  for  JPN  <->  EN
18
He saw a cat a long tail
this	
  is	
  Keiko	
  Tanaka	
  .	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  this	
  _va0	
  Keiko	
  Tanaka	
  is	
  .	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  田中 恵子 と 申し ます	
  
i	
  used	
  to	
  jog	
  every	
  morning	
  .	
  	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  i	
  _va0	
  every	
  morning	
  jog	
  to	
  used	
  .	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  毎朝 ジョギング し た もの です 。
she	
  was	
  wearing	
  a	
  sweater	
  and	
  high	
  heals	
  .	
   	
  	
  	
  	
  	
  	
  she	
  _va0	
  sweater	
  and	
  high	
  heals	
  _va2	
  wearing	
  was	
  .	
  
セーター を 着 て 、 ハイヒール を はい て い まし た 。
with sawcatwithlong tailが をHe
Post-Positional Particles
© 2015 Mirai Translate, Inc. All rights reserved.
Commerce
Patent Application
Finance
Corpus is the king,
19
Not only Size(Coverage)
but also Fitness.
Written Language Corpus Variation
Spoken
Language
Corpus
Variation Generic
Corpus
Travel
Public Patents
Ideal Corpus Data
but it must be decent and well-structured.
© 2015 Mirai Translate, Inc. All rights reserved.
20
SYSTRAN Training Server ‒ Main components
• Corpus Manager
• Mono/bilingual corpus
• Txt, html, doc, docx, rtf, xlsx, pptx, pdf, tmx
• Virtual file management (aggregation, split)
• Content Management Database (TU : Translation Units)
• Training Manager
• Baseline Evaluation (Quality metrics: GTM, BLEU, TER)
• Hybrid Model Training (SPE : Statistical Post-Edition)
• Statistical Model Training (SMT : Statistical Machine
Translation)
• Dictionary creation (UD) with bilingual terminology extraction
• Dictionary validation (UD) against a bilingual corpus (TMX)
• Translation Memory creation (TM) with document aligner
© 2015 Mirai Translate, Inc. All rights reserved.
Training Methodology
21
Collect	
  Data Run	
  Training Evaluate
Publish	
  to	
  Pilot/
Production	
  	
  
• Collect training data
• Define the domain
• Collect bilingual corpus (translation memories, documents and translations)
• Collect monolingual corpus (text, content relevant to the domain)
• Collect terminology if any (bilingual dictionaries, glossaries)
• Run initial training
• Evaluate
• Perform incremental cycles
© 2015 Mirai Translate, Inc. All rights reserved.
22
V.S.
© 2015 Mirai Translate, Inc. All rights reserved.
• Collaboration Tools
• Intranet Translation Portal	
  
• Web & Mobile Apps	
  
• Customer Service Portal
• Market Intelligence	
  
• Cyber-security	
  
• Forensic & eDiscovery Apps	
  
• Text Mining & Analytics
• Multilingual Web Site	
  
• Technical Translation Project	
  
• Translation Workflow Integration
Help and secure
information
communication
Detect critical information
within large scale foreign
data
Reduce costs and
timelines for translation
projects
Business
cases
Usages &
Applications
Customers
Translation Agencies &
Corporations
Defense & Securities &
Legal Organizations
Corporations & Public
Organizations
Localization
Multilingual
Communication
Big Data by HPC
Our Business Targets
• 3 main markets
23
© 2015 Mirai Translate, Inc. All rights reserved.
24
Multilingual  MT  
JP,  EN,  CN,  KR  +ASEAN
Enterprise  
Solutions
Consumer  
Services
We  are  an  engineering  company…
MT  APIs  
TMS
© 2015 Mirai Translate, Inc. All rights reserved.
25
It always seems impossible until it s done. - Nelson Mandela
As part of the Tomorrow television series 

produced by CBS for MIT's Centennial in 1961
© 2015 Mirai Translate, Inc. All rights reserved.
Their dreams
are coming true.
Mirai Translate, Inc.26
@mickbean

Weitere ähnliche Inhalte

Andere mochten auch

Drilling Down Into DNS DDoS
Drilling Down Into DNS DDoSDrilling Down Into DNS DDoS
Drilling Down Into DNS DDoSAPNIC
 
Avoiding dns amplification attacks
Avoiding dns amplification attacksAvoiding dns amplification attacks
Avoiding dns amplification attacksLucas Kauffman
 
Dns reflection attacks webinar slides
Dns reflection attacks webinar slidesDns reflection attacks webinar slides
Dns reflection attacks webinar slidesMen and Mice
 
Dns Amplification Zafiyeti
Dns Amplification ZafiyetiDns Amplification Zafiyeti
Dns Amplification ZafiyetiMehmet VAROL
 
The Other Advanced Attacks: DNS/NTP Amplification and Careto
The Other Advanced Attacks: DNS/NTP Amplification and CaretoThe Other Advanced Attacks: DNS/NTP Amplification and Careto
The Other Advanced Attacks: DNS/NTP Amplification and CaretoMike Chapple
 
Monitoring for DNS Security
Monitoring for DNS SecurityMonitoring for DNS Security
Monitoring for DNS SecurityThousandEyes
 
Finding Evil In DNS Traffic
Finding  Evil In DNS TrafficFinding  Evil In DNS Traffic
Finding Evil In DNS Trafficreal_slacker007
 
Security Onion Conference - 2016
Security Onion Conference - 2016Security Onion Conference - 2016
Security Onion Conference - 2016DefensiveDepth
 
MIRAI: What is It, How Does it Work and Why Should I Care?
MIRAI: What is It, How Does it Work and Why Should I Care?MIRAI: What is It, How Does it Work and Why Should I Care?
MIRAI: What is It, How Does it Work and Why Should I Care?Memoori
 
How IoT Is Breaking The Internet
How IoT Is Breaking The InternetHow IoT Is Breaking The Internet
How IoT Is Breaking The InternetCarl J. Levine
 
State of the Internet: Mirai, IOT and History of Botnets
State of the Internet: Mirai, IOT and History of BotnetsState of the Internet: Mirai, IOT and History of Botnets
State of the Internet: Mirai, IOT and History of BotnetsRahul Neel Mani
 
DNS Security
DNS SecurityDNS Security
DNS Securityinbroker
 
Dns security overview
Dns security overviewDns security overview
Dns security overviewVladimir2003
 
IoT - the Next Wave of DDoS Threat Landscape
IoT - the Next Wave of DDoS Threat LandscapeIoT - the Next Wave of DDoS Threat Landscape
IoT - the Next Wave of DDoS Threat LandscapeAPNIC
 
CNIT 40: 1: The Importance of DNS Security
CNIT 40: 1: The Importance of DNS SecurityCNIT 40: 1: The Importance of DNS Security
CNIT 40: 1: The Importance of DNS SecuritySam Bowne
 
(SEC306) Defending Against DDoS Attacks
(SEC306) Defending Against DDoS Attacks(SEC306) Defending Against DDoS Attacks
(SEC306) Defending Against DDoS AttacksAmazon Web Services
 
DNS Security Presentation ISSA
DNS Security Presentation ISSADNS Security Presentation ISSA
DNS Security Presentation ISSASrikrupa Srivatsan
 

Andere mochten auch (20)

Drilling Down Into DNS DDoS
Drilling Down Into DNS DDoSDrilling Down Into DNS DDoS
Drilling Down Into DNS DDoS
 
Avoiding dns amplification attacks
Avoiding dns amplification attacksAvoiding dns amplification attacks
Avoiding dns amplification attacks
 
Dns reflection attacks webinar slides
Dns reflection attacks webinar slidesDns reflection attacks webinar slides
Dns reflection attacks webinar slides
 
Dns Amplification Zafiyeti
Dns Amplification ZafiyetiDns Amplification Zafiyeti
Dns Amplification Zafiyeti
 
The Other Advanced Attacks: DNS/NTP Amplification and Careto
The Other Advanced Attacks: DNS/NTP Amplification and CaretoThe Other Advanced Attacks: DNS/NTP Amplification and Careto
The Other Advanced Attacks: DNS/NTP Amplification and Careto
 
Monitoring for DNS Security
Monitoring for DNS SecurityMonitoring for DNS Security
Monitoring for DNS Security
 
Finding Evil In DNS Traffic
Finding  Evil In DNS TrafficFinding  Evil In DNS Traffic
Finding Evil In DNS Traffic
 
Security Onion Conference - 2016
Security Onion Conference - 2016Security Onion Conference - 2016
Security Onion Conference - 2016
 
Dns tunnelling its all in the name
Dns tunnelling its all in the nameDns tunnelling its all in the name
Dns tunnelling its all in the name
 
MIRAI: What is It, How Does it Work and Why Should I Care?
MIRAI: What is It, How Does it Work and Why Should I Care?MIRAI: What is It, How Does it Work and Why Should I Care?
MIRAI: What is It, How Does it Work and Why Should I Care?
 
Advanced DNS Protection
Advanced DNS ProtectionAdvanced DNS Protection
Advanced DNS Protection
 
How IoT Is Breaking The Internet
How IoT Is Breaking The InternetHow IoT Is Breaking The Internet
How IoT Is Breaking The Internet
 
State of the Internet: Mirai, IOT and History of Botnets
State of the Internet: Mirai, IOT and History of BotnetsState of the Internet: Mirai, IOT and History of Botnets
State of the Internet: Mirai, IOT and History of Botnets
 
DNS Security
DNS SecurityDNS Security
DNS Security
 
Dns security overview
Dns security overviewDns security overview
Dns security overview
 
Security of DNS
Security of DNSSecurity of DNS
Security of DNS
 
IoT - the Next Wave of DDoS Threat Landscape
IoT - the Next Wave of DDoS Threat LandscapeIoT - the Next Wave of DDoS Threat Landscape
IoT - the Next Wave of DDoS Threat Landscape
 
CNIT 40: 1: The Importance of DNS Security
CNIT 40: 1: The Importance of DNS SecurityCNIT 40: 1: The Importance of DNS Security
CNIT 40: 1: The Importance of DNS Security
 
(SEC306) Defending Against DDoS Attacks
(SEC306) Defending Against DDoS Attacks(SEC306) Defending Against DDoS Attacks
(SEC306) Defending Against DDoS Attacks
 
DNS Security Presentation ISSA
DNS Security Presentation ISSADNS Security Presentation ISSA
DNS Security Presentation ISSA
 

Ähnlich wie Introduction of Mirai Translate, Inc.

An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013Welocalize
 
Unlock the Power of Machine Translation
Unlock the Power of Machine TranslationUnlock the Power of Machine Translation
Unlock the Power of Machine TranslationRDC
 
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...TAUS - The Language Data Network
 
Company and sevices overview
Company and sevices overviewCompany and sevices overview
Company and sevices overviewAsher Abraham
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013pangeanic
 
ChatGPT-Revolutionizing Communication.pdf
ChatGPT-Revolutionizing Communication.pdfChatGPT-Revolutionizing Communication.pdf
ChatGPT-Revolutionizing Communication.pdfRahul Ghorpade
 
Impetech Corporate Presentation
Impetech Corporate PresentationImpetech Corporate Presentation
Impetech Corporate PresentationSatya Patri
 
Enterprise DevOps: Crossing the Great Divide with DevOps Training
Enterprise DevOps: Crossing the Great Divide with DevOps TrainingEnterprise DevOps: Crossing the Great Divide with DevOps Training
Enterprise DevOps: Crossing the Great Divide with DevOps TrainingITpreneurs
 
Extend the Reach of R to the Enterprise (for useR! 2013)
Extend the Reach of R to the Enterprise (for useR! 2013)Extend the Reach of R to the Enterprise (for useR! 2013)
Extend the Reach of R to the Enterprise (for useR! 2013)Lou Bajuk
 
Learn how marketers use APIs to automate their stack
Learn how marketers use APIs to automate their stackLearn how marketers use APIs to automate their stack
Learn how marketers use APIs to automate their stackAlex Ortiz
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Will They Blend? - Agile, TOGAF and Enterprise Architecture
Will They Blend? - Agile, TOGAF and Enterprise ArchitectureWill They Blend? - Agile, TOGAF and Enterprise Architecture
Will They Blend? - Agile, TOGAF and Enterprise ArchitectureITpreneurs
 
TERASOLUNA Framework on the Spring IO Platform
TERASOLUNA Framework on the Spring IO PlatformTERASOLUNA Framework on the Spring IO Platform
TERASOLUNA Framework on the Spring IO Platformapkiban
 
JVMCON Java in the 21st Century: are you thinking far enough ahead?
JVMCON Java in the 21st Century: are you thinking far enough ahead?JVMCON Java in the 21st Century: are you thinking far enough ahead?
JVMCON Java in the 21st Century: are you thinking far enough ahead?Steve Poole
 
MSA, TBD, DDD, TDD, BDD, WTF?
MSA, TBD, DDD, TDD, BDD, WTF?MSA, TBD, DDD, TDD, BDD, WTF?
MSA, TBD, DDD, TDD, BDD, WTF?Michael Lambert
 
Onnx at lf oss na 20200629 v5
Onnx at lf oss na 20200629 v5Onnx at lf oss na 20200629 v5
Onnx at lf oss na 20200629 v5ISSIP
 

Ähnlich wie Introduction of Mirai Translate, Inc. (20)

An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013
 
Unlock the Power of Machine Translation
Unlock the Power of Machine TranslationUnlock the Power of Machine Translation
Unlock the Power of Machine Translation
 
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
FIPOTranslations - Who Need Them and How LE technologies Can Help, Henry Wang...
 
Flitto Company Deck_1Q17
Flitto Company Deck_1Q17Flitto Company Deck_1Q17
Flitto Company Deck_1Q17
 
Company and sevices overview
Company and sevices overviewCompany and sevices overview
Company and sevices overview
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013
 
ChatGPT-Revolutionizing Communication.pdf
ChatGPT-Revolutionizing Communication.pdfChatGPT-Revolutionizing Communication.pdf
ChatGPT-Revolutionizing Communication.pdf
 
Impetech Corporate Presentation
Impetech Corporate PresentationImpetech Corporate Presentation
Impetech Corporate Presentation
 
Enterprise DevOps: Crossing the Great Divide with DevOps Training
Enterprise DevOps: Crossing the Great Divide with DevOps TrainingEnterprise DevOps: Crossing the Great Divide with DevOps Training
Enterprise DevOps: Crossing the Great Divide with DevOps Training
 
Extend the Reach of R to the Enterprise (for useR! 2013)
Extend the Reach of R to the Enterprise (for useR! 2013)Extend the Reach of R to the Enterprise (for useR! 2013)
Extend the Reach of R to the Enterprise (for useR! 2013)
 
Learn how marketers use APIs to automate their stack
Learn how marketers use APIs to automate their stackLearn how marketers use APIs to automate their stack
Learn how marketers use APIs to automate their stack
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Will They Blend? - Agile, TOGAF and Enterprise Architecture
Will They Blend? - Agile, TOGAF and Enterprise ArchitectureWill They Blend? - Agile, TOGAF and Enterprise Architecture
Will They Blend? - Agile, TOGAF and Enterprise Architecture
 
TERASOLUNA Framework on the Spring IO Platform
TERASOLUNA Framework on the Spring IO PlatformTERASOLUNA Framework on the Spring IO Platform
TERASOLUNA Framework on the Spring IO Platform
 
JVMCON Java in the 21st Century: are you thinking far enough ahead?
JVMCON Java in the 21st Century: are you thinking far enough ahead?JVMCON Java in the 21st Century: are you thinking far enough ahead?
JVMCON Java in the 21st Century: are you thinking far enough ahead?
 
Oracle Cloud Café IOT 12 avril 2016
Oracle Cloud Café IOT 12 avril 2016Oracle Cloud Café IOT 12 avril 2016
Oracle Cloud Café IOT 12 avril 2016
 
Oracle Cloud Café IoT 12-APR-2016
Oracle Cloud Café IoT 12-APR-2016Oracle Cloud Café IoT 12-APR-2016
Oracle Cloud Café IoT 12-APR-2016
 
IETM Level 4 Service Provider -Code and Pixels.pdf
IETM Level 4 Service Provider -Code and Pixels.pdfIETM Level 4 Service Provider -Code and Pixels.pdf
IETM Level 4 Service Provider -Code and Pixels.pdf
 
MSA, TBD, DDD, TDD, BDD, WTF?
MSA, TBD, DDD, TDD, BDD, WTF?MSA, TBD, DDD, TDD, BDD, WTF?
MSA, TBD, DDD, TDD, BDD, WTF?
 
Onnx at lf oss na 20200629 v5
Onnx at lf oss na 20200629 v5Onnx at lf oss na 20200629 v5
Onnx at lf oss na 20200629 v5
 

Mehr von Osaka University

Generative AI: Redefining Creativity and Transforming Corporate Landscape
Generative AI: Redefining Creativity and Transforming Corporate LandscapeGenerative AI: Redefining Creativity and Transforming Corporate Landscape
Generative AI: Redefining Creativity and Transforming Corporate LandscapeOsaka University
 
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)Osaka University
 
立教大学MBA:AIの最先端技術によるこれからの価値創造
立教大学MBA:AIの最先端技術によるこれからの価値創造立教大学MBA:AIの最先端技術によるこれからの価値創造
立教大学MBA:AIの最先端技術によるこれからの価値創造Osaka University
 
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのことOsaka University
 
デジタル資本主義と スマートリスクの取り方
デジタル資本主義とスマートリスクの取り方デジタル資本主義とスマートリスクの取り方
デジタル資本主義と スマートリスクの取り方Osaka University
 
DX 組織デザインパターン
DX 組織デザインパターンDX 組織デザインパターン
DX 組織デザインパターンOsaka University
 
To be or not to be an academic, big enterprise, startup job that is the qu...
  To be or not to be an academic, big enterprise, startup job  that is the qu...  To be or not to be an academic, big enterprise, startup job  that is the qu...
To be or not to be an academic, big enterprise, startup job that is the qu...Osaka University
 
身の丈にあった社会問題解決
身の丈にあった社会問題解決身の丈にあった社会問題解決
身の丈にあった社会問題解決Osaka University
 
AI系ディープテックスタートアップ の経営環境
AI系ディープテックスタートアップの経営環境AI系ディープテックスタートアップの経営環境
AI系ディープテックスタートアップ の経営環境Osaka University
 
AI_IoTを活用する企業のあり方
AI_IoTを活用する企業のあり方AI_IoTを活用する企業のあり方
AI_IoTを活用する企業のあり方Osaka University
 
AI とデジタル変革
AI とデジタル変革AI とデジタル変革
AI とデジタル変革Osaka University
 
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきこと
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきことデジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきこと
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきことOsaka University
 
デジタルが切り開く未来ビジネス
デジタルが切り開く未来ビジネスデジタルが切り開く未来ビジネス
デジタルが切り開く未来ビジネスOsaka University
 
鉄腕アトムはできるか?
鉄腕アトムはできるか?鉄腕アトムはできるか?
鉄腕アトムはできるか?Osaka University
 
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~Osaka University
 
Move out from your comfort zone!
Move out from your comfort zone!Move out from your comfort zone!
Move out from your comfort zone!Osaka University
 
クラウドの進化とメディア理解の発展
クラウドの進化とメディア理解の発展クラウドの進化とメディア理解の発展
クラウドの進化とメディア理解の発展Osaka University
 

Mehr von Osaka University (20)

CREST AIの振り返り
CREST AIの振り返りCREST AIの振り返り
CREST AIの振り返り
 
Generative AI: Redefining Creativity and Transforming Corporate Landscape
Generative AI: Redefining Creativity and Transforming Corporate LandscapeGenerative AI: Redefining Creativity and Transforming Corporate Landscape
Generative AI: Redefining Creativity and Transforming Corporate Landscape
 
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)
自然言語処理の発展がもたらす未来(電気通信協会調査会での講演)
 
立教大学MBA:AIの最先端技術によるこれからの価値創造
立教大学MBA:AIの最先端技術によるこれからの価値創造立教大学MBA:AIの最先端技術によるこれからの価値創造
立教大学MBA:AIの最先端技術によるこれからの価値創造
 
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと
龍野高校創立125周年記念講演:2030年までにやっておくべき3つのこと
 
デジタル資本主義と スマートリスクの取り方
デジタル資本主義とスマートリスクの取り方デジタル資本主義とスマートリスクの取り方
デジタル資本主義と スマートリスクの取り方
 
DX 組織デザインパターン
DX 組織デザインパターンDX 組織デザインパターン
DX 組織デザインパターン
 
To be or not to be an academic, big enterprise, startup job that is the qu...
  To be or not to be an academic, big enterprise, startup job  that is the qu...  To be or not to be an academic, big enterprise, startup job  that is the qu...
To be or not to be an academic, big enterprise, startup job that is the qu...
 
DX と社会問題解決
DX と社会問題解決DX と社会問題解決
DX と社会問題解決
 
身の丈にあった社会問題解決
身の丈にあった社会問題解決身の丈にあった社会問題解決
身の丈にあった社会問題解決
 
AI系ディープテックスタートアップ の経営環境
AI系ディープテックスタートアップの経営環境AI系ディープテックスタートアップの経営環境
AI系ディープテックスタートアップ の経営環境
 
AI_IoTを活用する企業のあり方
AI_IoTを活用する企業のあり方AI_IoTを活用する企業のあり方
AI_IoTを活用する企業のあり方
 
AI とデジタル変革
AI とデジタル変革AI とデジタル変革
AI とデジタル変革
 
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきこと
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきことデジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきこと
デジタル変革とソフトウェア化する産業:これからの20年に君たちが知っておくべきこと
 
デジタルが切り開く未来ビジネス
デジタルが切り開く未来ビジネスデジタルが切り開く未来ビジネス
デジタルが切り開く未来ビジネス
 
デジタル戦略とAWS
デジタル戦略とAWSデジタル戦略とAWS
デジタル戦略とAWS
 
鉄腕アトムはできるか?
鉄腕アトムはできるか?鉄腕アトムはできるか?
鉄腕アトムはできるか?
 
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~
経営視点から考察するデジタル戦略 ~クラウドがもたらすビジネスインパクト~
 
Move out from your comfort zone!
Move out from your comfort zone!Move out from your comfort zone!
Move out from your comfort zone!
 
クラウドの進化とメディア理解の発展
クラウドの進化とメディア理解の発展クラウドの進化とメディア理解の発展
クラウドの進化とメディア理解の発展
 

Kürzlich hochgeladen

System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communicationpanditadesh123
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxStephen Sitton
 
Prach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism CommunityPrach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism Communityprachaibot
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating SystemRashmi Bhat
 
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdf
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdfPaper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdf
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdfNainaShrivastava14
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Coursebim.edu.pl
 
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.elesangwon
 
DEVICE DRIVERS AND INTERRUPTS SERVICE MECHANISM.pdf
DEVICE DRIVERS AND INTERRUPTS  SERVICE MECHANISM.pdfDEVICE DRIVERS AND INTERRUPTS  SERVICE MECHANISM.pdf
DEVICE DRIVERS AND INTERRUPTS SERVICE MECHANISM.pdfAkritiPradhan2
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfDrew Moseley
 
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmComputer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmDeepika Walanjkar
 
List of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfList of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfisabel213075
 
KCD Costa Rica 2024 - Nephio para parvulitos
KCD Costa Rica 2024 - Nephio para parvulitosKCD Costa Rica 2024 - Nephio para parvulitos
KCD Costa Rica 2024 - Nephio para parvulitosVictor Morales
 
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.ppt
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.pptROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.ppt
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.pptJohnWilliam111370
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating SystemRashmi Bhat
 
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSneha Padhiar
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewsandhya757531
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 

Kürzlich hochgeladen (20)

System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communication
 
Turn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptxTurn leadership mistakes into a better future.pptx
Turn leadership mistakes into a better future.pptx
 
Prach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism CommunityPrach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism Community
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating System
 
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdf
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdfPaper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdf
Paper Tube : Shigeru Ban projects and Case Study of Cardboard Cathedral .pdf
 
Katarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School CourseKatarzyna Lipka-Sidor - BIM School Course
Katarzyna Lipka-Sidor - BIM School Course
 
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
2022 AWS DNA Hackathon 장애 대응 솔루션 jarvis.
 
DEVICE DRIVERS AND INTERRUPTS SERVICE MECHANISM.pdf
DEVICE DRIVERS AND INTERRUPTS  SERVICE MECHANISM.pdfDEVICE DRIVERS AND INTERRUPTS  SERVICE MECHANISM.pdf
DEVICE DRIVERS AND INTERRUPTS SERVICE MECHANISM.pdf
 
Immutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdfImmutable Image-Based Operating Systems - EW2024.pdf
Immutable Image-Based Operating Systems - EW2024.pdf
 
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmComputer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
 
List of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdfList of Accredited Concrete Batching Plant.pdf
List of Accredited Concrete Batching Plant.pdf
 
KCD Costa Rica 2024 - Nephio para parvulitos
KCD Costa Rica 2024 - Nephio para parvulitosKCD Costa Rica 2024 - Nephio para parvulitos
KCD Costa Rica 2024 - Nephio para parvulitos
 
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.ppt
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.pptROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.ppt
ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.ppt
 
Main Memory Management in Operating System
Main Memory Management in Operating SystemMain Memory Management in Operating System
Main Memory Management in Operating System
 
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATIONSOFTWARE ESTIMATION COCOMO AND FP CALCULATION
SOFTWARE ESTIMATION COCOMO AND FP CALCULATION
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptx
 
Artificial Intelligence in Power System overview
Artificial Intelligence in Power System overviewArtificial Intelligence in Power System overview
Artificial Intelligence in Power System overview
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 

Introduction of Mirai Translate, Inc.

  • 1. © 2015 Mirai Translate, Inc. All rights reserved. Mirai Translate, Inc. 1 Impossible only means that you have still screwed up the solution. -Mick Etoh
  • 2. © 2015 Mirai Translate, Inc. All rights reserved. 2 Number of Inbound Visitors in 2014 13,413,567 JPY2,030,500,000,000 EUR15,522,200,000
  • 3. © 2015 Mirai Translate, Inc. All rights reserved. Translation Total Addressable Market (2014) 3 USD  2.1B MT market     USD  10M
  • 4. © 2015 Mirai Translate, Inc. All rights reserved. Unforeseen Challenges Ahead 4 Translation Speed (1/cost) Quality LSP Solutions IP Publication Reports CAT Speech Translater Google Translate Web Crowd Sourcing Solutions SOHO SOHO MT+Post Editing Solutions MT Real Time Solutions Unforeseen New Market Frontier 性能向上による 新領域
  • 5. © 2015 Mirai Translate, Inc. All rights reserved. 72% of Japanese don t speak English. 5
  • 6. © 2015 Mirai Translate, Inc. All rights reserved. 6 Vision To realize a society in which everyone can interact freely across language barriers with the use of machine translation technology, and thereby contribute to invigoration and innovation in businesses. Mirai Translate, Inc.
  • 7. © 2015 Mirai Translate, Inc. All rights reserved. Mirai  Translate  as  Joint  Venture 7 Mobile  Platform  Leader ASR  &  MT  Solution  Provider Multilingual  Enterprise  MT  developer NLP  and  MT  technology  leader Multilingual  SMT  technology  leader Technology   Transfer
  • 8. © 2015 Mirai Translate, Inc. All rights reserved. 8 Our Competence • Multiple Translation Engines from Systran and NICT • MT Training Tools from Systran • NLP Tools Named Entity Extraction, Pre-Ordering,… • NL Data Assets Corpus from Systran and NTT DOCOMO+ JPN Ontology Dictionary • Strong Technical Team Experiences in AWS, Data Mining, MT toward our own original MT systems.
  • 9. © 2015 Mirai Translate, Inc. All rights reserved. Siri 9 Big-Data, Big-Server, and Fat-Pipe Solution
  • 10. © 2015 Mirai Translate, Inc. All rights reserved. Shabette-Concier Voice agent service • Launched Mar. 1, 2012 • Over 40 services in it • Including chatting • 10 million users Shabette Voice = Concier Concierge = How may I help you? 10
  • 11. © 2015 Mirai Translate, Inc. All rights reserved. Touch the Concier.“Tell me how to make a pizza.”View a list of recipes of pizza.You can check a detailed recipe of pizza.“Tell me Italian restaurants nearby.”View a list of Italian restaurants.You can check detailed information of restaurants.11
  • 12. © 2015 Mirai Translate, Inc. All rights reserved.Touch the Concier.Q: “What is the height of Mt. Fuji?”A: “3,766m!”Q: “When is holding schedule of the Tokyo Olympic Games?”A: “It will hold in 2020.” 12
  • 13. © 2015 Mirai Translate, Inc. All rights reserved. Basic Architecture 2010 Logging Fuetrek Voice
 Recognition DOCOMO Task
 Recognition Logging Voice text text contents Service Providers’ DB contents text Text to speech 13 Fat-Pipe Big-Servers
  • 14. © 2015 Mirai Translate, Inc. All rights reserved. Mirai Architecture 2015 Logging Fuetrek Voice
 Recognition Mirai MT Engines Logging Voice text text contents Client Dictionary Corpus DB contents text Text to speech 14
  • 15. © 2015 Mirai Translate, Inc. All rights reserved. 15
  • 16. © 2015 Mirai Translate, Inc. All rights reserved. We are Cloud Natives 16 システム構成部品 who believe our cloud solution is scalable and safer!
  • 17. © 2015 Mirai Translate, Inc. All rights reserved. Bilingual  User   Dictionaries SYSnitionTRAN  7  HYBRID  ENGINE SYSTRAN  Hybrid  Architecture 17 Source Transl ation Main  Dictionaries   Linguistic  Rules User  Entities Rules-­‐Based   MT Statistical   Post-­‐ Edition SBS BS Target   Monolingual   Corpus Source   Adaptation BS Monolingual   Source  Corpus Bilingual  Corpus  or   Translation  Memories Bilingual   Translation   Models Target   Language   Models Source   Language   Models Self-­‐training Source   Normalization   Dictionaries Self-­‐Training Self-­‐Training SBS Statistical  MT Translation   Memories Bilingual   Terminology   Extraction  Spell  Check Homographs Target   Normalization   Dictionaries Translation     Memories Pre-Filter Formating Normalization Segmentation Entity Recognition Translation Memory User Dictionary Match Post-Processing Formatting Normalization Post-Filter a Commercial SMT Engine
  • 18. © 2015 Mirai Translate, Inc. All rights reserved. NTT  Technology  for  JPN  <->  EN 18 He saw a cat a long tail this  is  Keiko  Tanaka  .                                                                                                            this  _va0  Keiko  Tanaka  is  .                                                                                                                          田中 恵子 と 申し ます   i  used  to  jog  every  morning  .                                              i  _va0  every  morning  jog  to  used  .                                                                                                                                                                                  毎朝 ジョギング し た もの です 。 she  was  wearing  a  sweater  and  high  heals  .              she  _va0  sweater  and  high  heals  _va2  wearing  was  .   セーター を 着 て 、 ハイヒール を はい て い まし た 。 with sawcatwithlong tailが をHe Post-Positional Particles
  • 19. © 2015 Mirai Translate, Inc. All rights reserved. Commerce Patent Application Finance Corpus is the king, 19 Not only Size(Coverage) but also Fitness. Written Language Corpus Variation Spoken Language Corpus Variation Generic Corpus Travel Public Patents Ideal Corpus Data but it must be decent and well-structured.
  • 20. © 2015 Mirai Translate, Inc. All rights reserved. 20 SYSTRAN Training Server ‒ Main components • Corpus Manager • Mono/bilingual corpus • Txt, html, doc, docx, rtf, xlsx, pptx, pdf, tmx • Virtual file management (aggregation, split) • Content Management Database (TU : Translation Units) • Training Manager • Baseline Evaluation (Quality metrics: GTM, BLEU, TER) • Hybrid Model Training (SPE : Statistical Post-Edition) • Statistical Model Training (SMT : Statistical Machine Translation) • Dictionary creation (UD) with bilingual terminology extraction • Dictionary validation (UD) against a bilingual corpus (TMX) • Translation Memory creation (TM) with document aligner
  • 21. © 2015 Mirai Translate, Inc. All rights reserved. Training Methodology 21 Collect  Data Run  Training Evaluate Publish  to  Pilot/ Production     • Collect training data • Define the domain • Collect bilingual corpus (translation memories, documents and translations) • Collect monolingual corpus (text, content relevant to the domain) • Collect terminology if any (bilingual dictionaries, glossaries) • Run initial training • Evaluate • Perform incremental cycles
  • 22. © 2015 Mirai Translate, Inc. All rights reserved. 22 V.S.
  • 23. © 2015 Mirai Translate, Inc. All rights reserved. • Collaboration Tools • Intranet Translation Portal   • Web & Mobile Apps   • Customer Service Portal • Market Intelligence   • Cyber-security   • Forensic & eDiscovery Apps   • Text Mining & Analytics • Multilingual Web Site   • Technical Translation Project   • Translation Workflow Integration Help and secure information communication Detect critical information within large scale foreign data Reduce costs and timelines for translation projects Business cases Usages & Applications Customers Translation Agencies & Corporations Defense & Securities & Legal Organizations Corporations & Public Organizations Localization Multilingual Communication Big Data by HPC Our Business Targets • 3 main markets 23
  • 24. © 2015 Mirai Translate, Inc. All rights reserved. 24 Multilingual  MT   JP,  EN,  CN,  KR  +ASEAN Enterprise   Solutions Consumer   Services We  are  an  engineering  company… MT  APIs   TMS
  • 25. © 2015 Mirai Translate, Inc. All rights reserved. 25 It always seems impossible until it s done. - Nelson Mandela As part of the Tomorrow television series produced by CBS for MIT's Centennial in 1961
  • 26. © 2015 Mirai Translate, Inc. All rights reserved. Their dreams are coming true. Mirai Translate, Inc.26 @mickbean