PDF] Arabic to French Sentence Alignment: Exploration of A Cross
Por um escritor misterioso
Last updated 15 junho 2024
A new approach to aligning sentences from a parallel corpus based on a cross-language information retrieval system is presented and it is shown that alignment has correct precision and recall even when the corpus is not completely parallel. Sentence alignment consists in estimating which sentence or sentences in the source language correspond with which sentence or sentences in a target language. We present in this paper a new approach to aligning sentences from a parallel corpus based on a cross-language information retrieval system. This approach consists in building a database of sentences of the target text and considering each sentence of the source text as a "query" to that database. The cross-language information retrieval system is a weighted Boolean search engine based on a deep linguistic analysis of the query and the documents to be indexed. This system is composed of a multilingual linguistic analyzer, a statistical analyzer, a reformulator, a comparator and a search engine. The multilingual linguistic analyzer includes a morphological analyzer, a part-of-speech tagger and a syntactic analyzer. The linguistic analyzer processes both documents to be indexed and queries to produce a set of normalized lemmas, a set of named entities and a set of nominal compounds with their morpho-syntactic tags. The statistical analyzer computes for documents to be indexed concept weights based on concept database frequencies. The comparator computes intersections between queries and documents and provides a relevance weight for each intersection. Before this comparison, the reformulator expands queries during the search. The expansion is used to infer from the original query words other words expressing the same concepts. The search engine retrieves the ranked, relevant documents from the indexes according to the corresponding reformulated query and then merges the results obtained for each language, taking into account the original words of the query and their weights in order to score the documents. The sentence aligner has been evaluated on the MD corpus of the ARCADE II project which is composed of news articles from the French newspaper "Le Monde Diplomatique". The part of the corpus used in evaluation consists of the same subset of sentences in Arabic and French. Arabic sentences are aligned to their French counterparts. Results showed that alignment has correct precision and recall even when the corpus is not completely parallel (changes in sentence order or missing sentences).
Align and distribute objects using rulers
Behind the Scenes: Exploring the Inner Workings of ChatGPT – Part 1
Kafalah by ISS/IRC - Issuu
Exploring Islamic Social Work. Between Community and the Common Good - Pluriel
Languages, Free Full-Text
All Eyes on Egypt: Islam and the Medical Use of Dead Bodies Amidst Cairo's Political Unrest: Medical Anthropology: Vol 35, No 3
Transforming knowledge for just and sustainable futures: International Conference to mark the 30th anniversary of the UNITWIN/UNESCO Chairs Programme; 3-4 November 2022, UNESCO Headquarters, Paris, France
Account Development Executive Resume Sample 2023
Arabic to French Sentence Alignment: Exploration of A Cross-language Information Retrieval Approach - ACL Anthology
Modelling individual and cross-cultural variation in the mapping of emotions to speech prosody
Recomendado para você
-
Fun in Fall - What's New in Room 102?15 junho 2024
-
Use Crosscheck In A Sentence15 junho 2024
-
Using the cross as a personal adornment has been en voguesince the15 junho 2024
-
Minimalist Education15 junho 2024
-
Definition and Use of Strikethrough15 junho 2024
-
Solved Use your understanding of planning to complete the15 junho 2024
-
Decodable Readers Multisyllables Open Syllables Books and Lesson15 junho 2024
-
Scrabble Quip Qubes Word cross sentence Board Game 198115 junho 2024
-
Check out the big brain on Mandy (another large language model test, details in comments) : r/replika15 junho 2024
-
7 Cross checking ideas teaching reading, reading strategies, first grade reading15 junho 2024
você pode gostar
-
Squid Game: The Challenge' Finale: An Addictive Abomination15 junho 2024
-
Filmes e séries baseados em games para assistir durante a quarentena15 junho 2024
-
Balanço das atividades: Robson Viana - Assembleia Legislativa de Sergipe15 junho 2024
-
The World's Hardest Game - Walkthrough Level 415 junho 2024
-
Massive Multiplayer Online Games (MMOs) - OSCAR15 junho 2024
-
capcut quote meme|TikTok Search15 junho 2024
-
CapCut_pp laylaylayla15 junho 2024
-
Evolution Series Fossil Pokemon Aerodactyl Set - Pokemon Resin Statue - PPAP Studios [In Stock]15 junho 2024
-
Pokemon Go egg chart: Every Pokemon you can hatch from Generation 215 junho 2024
-
ZombieCastBG #23 - Entrevista com o Z-Team by ZombieCastBG15 junho 2024