{"id":63311,"date":"2012-11-01T00:00:00","date_gmt":"2012-11-01T00:00:00","guid":{"rendered":"https:\/\/dataladder.com\/techniques-depuration-des-donnees-pour-les-redondances\/"},"modified":"2022-04-11T10:34:53","modified_gmt":"2022-04-11T10:34:53","slug":"techniques-depuration-des-donnees-pour-les-redondances","status":"publish","type":"post","link":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/","title":{"rendered":"Techniques d&rsquo;\u00e9puration des donn\u00e9es pour les redondances"},"content":{"rendered":"<p>Le traitement des donn\u00e9es dupliqu\u00e9es n\u00e9cessite une strat\u00e9gie pour traiter les donn\u00e9es incoh\u00e9rentes. La premi\u00e8re \u00e9tape consisterait \u00e0 normaliser les adresses \u00e0 l&rsquo;aide d&rsquo;un <a href=\"https:\/\/dataladder.com\/fr\/logiciel-de-correspondance-de-donnees-classe-parmi-les-meilleurs-de-sa-categorie-avec-une-precision-de-correspondance-de-96\/\">logiciel de rapprochement des donn\u00e9es<\/a>. Deuxi\u00e8mement, assurez-vous d&rsquo;utiliser des programmes de saisie de donn\u00e9es qui valident les formats des champs, afin d&rsquo;\u00e9viter les erreurs, comme la saisie de noms dans un champ de num\u00e9ro de t\u00e9l\u00e9phone. Il est essentiel de trouver tous les enregistrements qui contiennent exactement ou approximativement les m\u00eames donn\u00e9es dans un ou plusieurs champs. Examinez l&rsquo;\u00e9chantillon ci-dessous de cinq enregistrements contenant six champs dans chaque enregistrement :<br \/>\nNom Adresse Ville St ZIP\u00ae T\u00e9l\u00e9phone<br \/>\n&#8212;&#8212; &#8212;&#8212;&#8212;&#8212;&#8212;&#8211; &#8212;&#8212;&#8212;&#8212; &#8212; &#8212;&#8212;&#8212;- &#8212;&#8212;&#8212;&#8212;&#8211;<br \/>\n1 DAVIS 115 E 1ST ST CLEBURNE TX 76031-2407 (817) 458 9992<br \/>\n2 DAVIS 1 115 ST EAST CLEBURNE TX 76031<br \/>\n3 DAVIS 1 EAST 15TH CLEBURNE DR TX 817-458-9992<br \/>\n4 DAVIS 1 E FIFTEENTH ST CLEBURNE TX 76031 458-9992<br \/>\n5 DAVIS ONE EAST 15TH ST CLEBURNE TX 76031 817-458-9991<\/p>\n<p>Vous verrez que les cinq enregistrements ci-dessus concernent la m\u00eame personne \u00e0 la m\u00eame adresse ; il n&rsquo;y a pas deux enregistrements exactement identiques. Envisagez ensuite les tentatives possibles pour localiser les doublons dans le fichier :<br \/>\n<strong>BROWSE 1<\/strong>: S\u00e9lectionnez les enregistrements ayant le m\u00eame champ d&rsquo;adresse. Ne trouve aucun des documents susmentionn\u00e9s.<br \/>\n<strong>BROWSE 2<\/strong>: S\u00e9lectionnez les enregistrements ayant le m\u00eame nom et le m\u00eame code postal \u00e0 cinq chiffres. Manque les enregistrements 1, 3 et 5.<br \/>\n<strong>BROWSE 3<\/strong>: S\u00e9lectionnez les enregistrements portant le nom \u00ab\u00a0DAVIS\u00a0\u00bb. Manque les enregistrements 2 et 3 (tout en correspondant probablement \u00e0 beaucoup d&rsquo;autres DAVIS \u00e0 d&rsquo;autres adresses).<br \/>\nApr\u00e8s avoir effectu\u00e9 une correction d&rsquo;adresse et une validation sur le terrain, les \u00e9chantillons \u00e9num\u00e9r\u00e9s ci-dessus deviennent :<br \/>\nNom Adresse Ville St ZIP T\u00e9l\u00e9phone<br \/>\n&#8212;&#8211; &#8212;&#8212;&#8212;&#8211; &#8212;&#8212;- &#8212; &#8212;&#8212;&#8212;- &#8212;&#8212;&#8212;&#8212;<br \/>\n1 DAVIS 115 E<sup>1ST<\/sup> ST CLEBURNE TX 76031-2407 817-458-9992<br \/>\n2 DAVIS 115 E<sup>1ST<\/sup> ST CLEBURNE TX 76031-2407<br \/>\n3 DAVIS 115 E<sup>1ST<\/sup> ST CLEBURNE TX 76031-2407 817-458-9992<br \/>\n4 DAVIS 115 E<sup>1ST<\/sup> ST CLEBURNE TX 76031-2407 XXX-458-9992<br \/>\n5 DAVIS 115 E<sup>1ST<\/sup> ST CLEBURNE TX 76031-2407 817-458-9992<\/p>\n<p>Une fois la normalisation termin\u00e9e, les tentatives de d\u00e9tection des doublons seront grandement am\u00e9lior\u00e9es et auront plus de chances de trouver le bon groupe de doublons. En s\u00e9lectionnant \u00ab\u00a0les enregistrements ayant la m\u00eame adresse, le m\u00eame code postal et le m\u00eame nom soundex\u00a0\u00bb, on obtient un r\u00e9sultat parfait dans l&rsquo;exemple ci-dessus.<\/p>\n<p>Data Ladder est votre partenaire et votre expert en analyse pour vous aider \u00e0 r\u00e9soudre les probl\u00e8mes de redondance et de duplication. Nous pouvons apporter simplicit\u00e9 et clart\u00e9 \u00e0 un projet autrement embrouill\u00e9 et compliqu\u00e9. Ayez confiance que Data Ladder vous aidera \u00e0 r\u00e9soudre vos probl\u00e8mes de qualit\u00e9 de donn\u00e9es et \u00e0 am\u00e9liorer de fa\u00e7on mesurable la qualit\u00e9 et les performances financi\u00e8res. <a href=\"mailto:info@dataladder.com\">Contactez-nous<\/a> pour plus d&rsquo;informations et pour obtenir votre <a href=\"https:\/\/dataladder.com\/fr\/essai-gratuit-logiciel-de-comparaison-de-donnees\/\">essai gratuit.<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Le traitement des donn\u00e9es dupliqu\u00e9es n\u00e9cessite une strat\u00e9gie pour traiter les donn\u00e9es incoh\u00e9rentes. La premi\u00e8re \u00e9tape consisterait \u00e0 normaliser les adresses \u00e0 l&rsquo;aide d&rsquo;un logiciel de rapprochement des donn\u00e9es. Deuxi\u00e8mement, assurez-vous d&rsquo;utiliser des programmes de saisie de donn\u00e9es qui valident les formats des champs, afin d&rsquo;\u00e9viter les erreurs, comme la saisie de noms dans un [&hellip;]<\/p>\n","protected":false},"author":9,"featured_media":3221,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_lmt_disableupdate":"","_lmt_disable":"","_links_to":"","_links_to_target":""},"categories":[],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v19.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Techniques d&#039;\u00e9puration des donn\u00e9es pour les redondances - Data Ladder<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Techniques d&#039;\u00e9puration des donn\u00e9es pour les redondances - Data Ladder\" \/>\n<meta property=\"og:description\" content=\"Le traitement des donn\u00e9es dupliqu\u00e9es n\u00e9cessite une strat\u00e9gie pour traiter les donn\u00e9es incoh\u00e9rentes. La premi\u00e8re \u00e9tape consisterait \u00e0 normaliser les adresses \u00e0 l&rsquo;aide d&rsquo;un logiciel de rapprochement des donn\u00e9es. Deuxi\u00e8mement, assurez-vous d&rsquo;utiliser des programmes de saisie de donn\u00e9es qui valident les formats des champs, afin d&rsquo;\u00e9viter les erreurs, comme la saisie de noms dans un [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\" \/>\n<meta property=\"og:site_name\" content=\"Data Ladder\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/web.facebook.com\/DataLadderSoftware\" \/>\n<meta property=\"article:published_time\" content=\"2012-11-01T00:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-04-11T10:34:53+00:00\" \/>\n<meta name=\"author\" content=\"lbarrera\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"lbarrera\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\"},\"author\":{\"name\":\"lbarrera\",\"@id\":\"https:\/\/dataladder.com\/fr\/#\/schema\/person\/6cc3d6b3c83c611546541b5eb2d1e21b\"},\"headline\":\"Techniques d&rsquo;\u00e9puration des donn\u00e9es pour les redondances\",\"datePublished\":\"2012-11-01T00:00:00+00:00\",\"dateModified\":\"2022-04-11T10:34:53+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\"},\"wordCount\":488,\"publisher\":{\"@id\":\"https:\/\/dataladder.com\/fr\/#organization\"},\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\",\"url\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\",\"name\":\"Techniques d'\u00e9puration des donn\u00e9es pour les redondances - Data Ladder\",\"isPartOf\":{\"@id\":\"https:\/\/dataladder.com\/fr\/#website\"},\"datePublished\":\"2012-11-01T00:00:00+00:00\",\"dateModified\":\"2022-04-11T10:34:53+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/dataladder.com\/fr\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Techniques d&#8217;\u00e9puration des donn\u00e9es pour les redondances\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/dataladder.com\/fr\/#website\",\"url\":\"https:\/\/dataladder.com\/fr\/\",\"name\":\"Data Ladder\",\"description\":\"Enterprise Data Profiling, Cleansing, and Matching\",\"publisher\":{\"@id\":\"https:\/\/dataladder.com\/fr\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/dataladder.com\/fr\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/dataladder.com\/fr\/#organization\",\"name\":\"Data Ladder\",\"url\":\"https:\/\/dataladder.com\/fr\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/dataladder.com\/fr\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/dataladder.com\/wp-content\/uploads\/2018\/06\/DL-Logo-Ball-30.png\",\"contentUrl\":\"https:\/\/dataladder.com\/wp-content\/uploads\/2018\/06\/DL-Logo-Ball-30.png\",\"width\":413,\"height\":408,\"caption\":\"Data Ladder\"},\"image\":{\"@id\":\"https:\/\/dataladder.com\/fr\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.linkedin.com\/company\/dataladder-llc\/\",\"https:\/\/web.facebook.com\/DataLadderSoftware\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/dataladder.com\/fr\/#\/schema\/person\/6cc3d6b3c83c611546541b5eb2d1e21b\",\"name\":\"lbarrera\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/dataladder.com\/fr\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/5198cb4dd374e7d879a15a9cf20299b3?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/5198cb4dd374e7d879a15a9cf20299b3?s=96&d=mm&r=g\",\"caption\":\"lbarrera\"},\"url\":\"https:\/\/dataladder.com\/fr\/author\/lbarrera\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Techniques d'\u00e9puration des donn\u00e9es pour les redondances - Data Ladder","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/","og_locale":"fr_FR","og_type":"article","og_title":"Techniques d'\u00e9puration des donn\u00e9es pour les redondances - Data Ladder","og_description":"Le traitement des donn\u00e9es dupliqu\u00e9es n\u00e9cessite une strat\u00e9gie pour traiter les donn\u00e9es incoh\u00e9rentes. La premi\u00e8re \u00e9tape consisterait \u00e0 normaliser les adresses \u00e0 l&rsquo;aide d&rsquo;un logiciel de rapprochement des donn\u00e9es. Deuxi\u00e8mement, assurez-vous d&rsquo;utiliser des programmes de saisie de donn\u00e9es qui valident les formats des champs, afin d&rsquo;\u00e9viter les erreurs, comme la saisie de noms dans un [&hellip;]","og_url":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/","og_site_name":"Data Ladder","article_publisher":"https:\/\/web.facebook.com\/DataLadderSoftware","article_published_time":"2012-11-01T00:00:00+00:00","article_modified_time":"2022-04-11T10:34:53+00:00","author":"lbarrera","twitter_card":"summary_large_image","twitter_misc":{"\u00c9crit par":"lbarrera","Dur\u00e9e de lecture estim\u00e9e":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#article","isPartOf":{"@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/"},"author":{"name":"lbarrera","@id":"https:\/\/dataladder.com\/fr\/#\/schema\/person\/6cc3d6b3c83c611546541b5eb2d1e21b"},"headline":"Techniques d&rsquo;\u00e9puration des donn\u00e9es pour les redondances","datePublished":"2012-11-01T00:00:00+00:00","dateModified":"2022-04-11T10:34:53+00:00","mainEntityOfPage":{"@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/"},"wordCount":488,"publisher":{"@id":"https:\/\/dataladder.com\/fr\/#organization"},"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/","url":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/","name":"Techniques d'\u00e9puration des donn\u00e9es pour les redondances - Data Ladder","isPartOf":{"@id":"https:\/\/dataladder.com\/fr\/#website"},"datePublished":"2012-11-01T00:00:00+00:00","dateModified":"2022-04-11T10:34:53+00:00","breadcrumb":{"@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/dataladder.com\/fr\/techniques-depuration-des-donnees-pour-les-redondances\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dataladder.com\/fr\/"},{"@type":"ListItem","position":2,"name":"Techniques d&#8217;\u00e9puration des donn\u00e9es pour les redondances"}]},{"@type":"WebSite","@id":"https:\/\/dataladder.com\/fr\/#website","url":"https:\/\/dataladder.com\/fr\/","name":"Data Ladder","description":"Enterprise Data Profiling, Cleansing, and Matching","publisher":{"@id":"https:\/\/dataladder.com\/fr\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dataladder.com\/fr\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/dataladder.com\/fr\/#organization","name":"Data Ladder","url":"https:\/\/dataladder.com\/fr\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dataladder.com\/fr\/#\/schema\/logo\/image\/","url":"https:\/\/dataladder.com\/wp-content\/uploads\/2018\/06\/DL-Logo-Ball-30.png","contentUrl":"https:\/\/dataladder.com\/wp-content\/uploads\/2018\/06\/DL-Logo-Ball-30.png","width":413,"height":408,"caption":"Data Ladder"},"image":{"@id":"https:\/\/dataladder.com\/fr\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.linkedin.com\/company\/dataladder-llc\/","https:\/\/web.facebook.com\/DataLadderSoftware"]},{"@type":"Person","@id":"https:\/\/dataladder.com\/fr\/#\/schema\/person\/6cc3d6b3c83c611546541b5eb2d1e21b","name":"lbarrera","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dataladder.com\/fr\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/5198cb4dd374e7d879a15a9cf20299b3?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5198cb4dd374e7d879a15a9cf20299b3?s=96&d=mm&r=g","caption":"lbarrera"},"url":"https:\/\/dataladder.com\/fr\/author\/lbarrera\/"}]}},"modified_by":null,"_links":{"self":[{"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/posts\/63311"}],"collection":[{"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/comments?post=63311"}],"version-history":[{"count":1,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/posts\/63311\/revisions"}],"predecessor-version":[{"id":66834,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/posts\/63311\/revisions\/66834"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/media\/3221"}],"wp:attachment":[{"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/media?parent=63311"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/categories?post=63311"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dataladder.com\/fr\/wp-json\/wp\/v2\/tags?post=63311"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}