System development company in Tachikawa, Tokyo [CONPHIC Co., Ltd.]
Blog
  • HOME »
  • Blog »
  • Data conversion

Data conversion

Server Based Converter

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. I think you sometimes want to convert a file type when you deal with document data. Though you can covert a file type by installing a software on your server, CONPHIC Co., Ltd. can offer an environment where you can convert a file type on a server and download it depending on your request by using Server Based Converter of Antenna House which we sell as a reseller. We can develop a system based on your requests like a document management system and a document release system. We use the existing data and contribute to …

Creating Digital Data of Vertical Text

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. CONPHIC Co., Ltd. has built many databases of various types of documents and created a lot of digital data of the documents. In case of Japanese sentences, the sentences are wtitten vertically in most paper-based documents. When we create digital data of these documents, we use OCR software suited for Japanese sentences. When we make digital data of vertical text, we need pay attention to recursive symbols and lines of two double-byte characters which is not included in horizontal writing. In addition, when we make digital text of old books or books including a lot …

Digital Book

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. The engineers in CONPHIC Co., Ltd. have a lot of experiences in creating data of digital books. Though books are recently published in the form of digital books, we made digital (text) data of a great deal of out-of-copyright books about 20 years ago. We cut off the binding of books, scan and OCR them, proof the characters and make HTML or XML data. 青空文庫 has a lot of data of books and the data we created is stored there. By making digital data of books, we can use functions like full-text search and extraction …

Data Aggregation

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. It is important to consider how to show the data collected in a database. If you have a customer information database, a document database and a questionnaire database, you can increase the rate of utilization of the data and the system by aggregating these data properly and showing necessary information adequately. In addition to building a database just to store data, it is necessay to collect data with a clear purpose. CONPHIC Co., Ltd. collect data, converts them as preprocessing and builds a database. We design and build an appropriate system aggregating necessary information from …

AH Formatter

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. CONPHIC Co., Ltd. can conduct automated formatting of malutilingual documents with AH Formatter developed by Antenna House, Inc.. We create PDF files with the most appropriate layout from XML documents of books, ledger sheets, manuals, dictionaries and so on. We design an efficient layout for your use like PDF for printing or PDF posted on Web. We lay out with XSL-FO. Our engineers have a lot of experiences in automated formatting of XML documents with AH Formatter. There are cases where we conduct formatting in stand-alone mode, but we can also build a system to …

Needs for Multiple Languages

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. I recently feel more and more needs to multilingualize many kinds of Websites and pamphlet for visitors and international students from other countries. Tokyo Metropolitan Government opened “Website for preparation of multilingual menus” to welcome tourists from other countries. In this Website, they provide translation of menus and sheet for touch and talk without any charge. I think there will be more and more needs to multilingualize many types of information as well as the information about tourism in preparation for Olympic in 2020. In addition to the development of multilingual Websites, CONPHIC Co., Ltd. …

Information Assets

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. We are sometimes asked about the usage of XML data after making XML data and building DB of big data. It is true a customer should have the purpose of the computerization of the data, but there are some cases where the customers think something might be worked out if they use XML data. When CONPHIC Co., Ltd. conducts hearing about computerization or DB, we offer the data format based on the type and characteristic of the document. It might be fact that there are recently many cases where customers create a versatile XML data …

Release of Text Conversion Tool!

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. Before text processing, to delete unnecessary new-line characters and space characters and to be consistent in writing style like one-byte or two-byte characters, we need knowledge about regular expression and programming. We released a text conversion tool to easily format Japanese texts without these knowledge. Anybody can use this for free. You can use it, for example, to reuse the documents by deleting unnecessary new-line characters in the copy texts from PDF files, or to unify the format of documents created by a group of some people. We will introduce the use cases in our …

Data Processing and Corpus Creation

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. CONPHIC Co., Ltd. often receives the order for corpus creation in the business of big data of the documents. Corpus:wikipediea We create basic data for a natural language processing and a research like corpus with structured information and bilingual corpus. In National Institute for Japanese Language and Linguistics in Tachikawa, Center for Corpus Development creates a lot of corpus and releases them without any charge. I suggest you should use them. CONPHIC Co., Ltd. offers a lot of tools and systems to improve the efficiency and qualities of works of document creation. Please contact CONPHIC …

Morphological Analysis

Systems Development Corporation in Tachikawa, Tokyo CONPHIC Co., Ltd. Thank you for visiting our blog. Japanese sentences do not have spaces between words (word separation) unlike English and Korean. So we use a morphological analysis tool to extract words from Japanese sentences. CONPHIC Co., Ltd. uses MeCab as a morphological analysis tool. We customize a dictionary for morphological analysis depending on the documents. We categorize the document types based on the terms in the sentences and develop a proofreading tool to be consistent in the terms in your documents by checking the appearance frequency of the terms. You can find the characteristic of the documents by analyzing the extracted words. …

« 1 49 50 51 52 »

Archives

April 2024
M T W T F S S
« Apr    
1234567
891011121314
15161718192021
22232425262728
2930  
PAGETOP
©2014-2024 CONPHIC Co., Ltd.