{"id":669,"date":"2019-12-24T09:21:56","date_gmt":"2019-12-24T08:21:56","guid":{"rendered":"http:\/\/pepason.fr\/?p=669"},"modified":"2020-09-22T17:17:14","modified_gmt":"2020-09-22T15:17:14","slug":"deeplearning-et-soundscapes-par-q-a","status":"publish","type":"post","link":"https:\/\/pepason.fr\/?p=669","title":{"rendered":"&#8220;Deeplearning&#8221; et &#8220;Soundscapes&#8221; par Q.A"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"669\" class=\"elementor elementor-669\">\n\t\t\t\t\t\t<div class=\"elementor-inner\">\n\t\t\t\t<div class=\"elementor-section-wrap\">\n\t\t\t\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-66ccd1e elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"66ccd1e\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t\t\t<div class=\"elementor-row\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-71a7de4\" data-id=\"71a7de4\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-column-wrap elementor-element-populated\">\n\t\t\t\t\t\t\t<div class=\"elementor-widget-wrap\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-1050c00 elementor-widget elementor-widget-heading\" data-id=\"1050c00\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\">Cr\u00e9er des paysages sonore... Un projet d'informatique de Quentin Aristote<\/h1>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-83ab47c elementor-widget elementor-widget-text-editor\" data-id=\"83ab47c\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"elementor-text-editor elementor-clearfix\">\n\t\t\t\t<p><em>Comment d\u00e9finir LE paysage sonore d\u2019un espace ? En effet, comment d\u00e9finir ce qui change en permanence ?\u00a0 Comment le reproduire avec suffisamment de pr\u00e9cision pour que notre oreille le reconnaisse et dise : &#8220;Ah oui, ce son vient de l\u00e0 !&#8221; ?&#8230;<\/em><\/p><p>Pour r\u00e9pondre \u00e0 cette question, Quentin Aristote, \u00e9l\u00e8ve en Master 1 d&#8217;Informatique \u00e0 l&#8217;Ecole Normale Sup\u00e9rieure s&#8217;associe \u00e0 PePaSon et propose de coder un algorithme de &#8220;<a href=\"https:\/\/fr.wikipedia.org\/wiki\/Apprentissage_profond\">deeplearning<\/a>&#8221; pour\u00a0<strong>apprendre<\/strong> \u00e0 partir d&#8217;enregistrements r\u00e9elles \u00e0 <strong>reconstruire<\/strong> des paysages sonores&#8230; irr\u00e9els mais similaires.\u00a0<\/p>\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-13cca24 elementor-align-left elementor-widget elementor-widget-button\" data-id=\"13cca24\" data-element_type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t<a href=\"https:\/\/git.eleves.ens.fr\/qaristote\/soundscapes\" class=\"elementor-button-link elementor-button elementor-size-sm\" role=\"button\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t<span class=\"elementor-button-icon elementor-align-icon-left\">\n\t\t\t\t<i aria-hidden=\"true\" class=\"fab fa-github\"><\/i>\t\t\t<\/span>\n\t\t\t\t\t\t<span class=\"elementor-button-text\">T\u00e9l\u00e9charger le code \"open-source\" sur GitHub<\/span>\n\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-ce8d995 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"ce8d995\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t\t\t<div class=\"elementor-row\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-4f8b29d\" data-id=\"4f8b29d\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-column-wrap elementor-element-populated\">\n\t\t\t\t\t\t\t<div class=\"elementor-widget-wrap\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-0ebbba6 elementor-widget elementor-widget-image\" data-id=\"0ebbba6\" data-element_type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"elementor-image\">\n\t\t\t\t\t\t\t\t\t<figure class=\"wp-caption\">\n\t\t\t\t\t\t\t\t\t\t<img width=\"800\" height=\"283\" src=\"https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-1024x362.png\" class=\"attachment-large size-large\" alt=\"\" loading=\"lazy\" srcset=\"https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-1024x362.png 1024w, https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-300x106.png 300w, https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-768x272.png 768w, https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-1536x543.png 1536w, https:\/\/pepason.fr\/wp-content\/uploads\/2020\/09\/Spectro_Quentin_low-2048x724.png 2048w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/>\t\t\t\t\t\t\t\t\t\t\t<figcaption class=\"widget-image-caption wp-caption-text\">Extrait du spectrogramme produit par l'algorithme \u00e0 la fin du projet. Ca ne marche pas encore... Mais c'est une belle tentative loin d'\u00eatre al\u00e9atoire !<\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-77215be\" data-id=\"77215be\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-column-wrap elementor-element-populated\">\n\t\t\t\t\t\t\t<div class=\"elementor-widget-wrap\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-cb2bdfc elementor-widget elementor-widget-text-editor\" data-id=\"cb2bdfc\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"elementor-text-editor elementor-clearfix\">\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-8c6788b elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"8c6788b\" data-element_type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-97d0e63 elementor-widget elementor-widget-heading\" data-id=\"97d0e63\" data-element_type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\">Description du projet (Q.A) - Abstract<\/h4>\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4209201 elementor-widget elementor-widget-text-editor\" data-id=\"4209201\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"elementor-text-editor elementor-clearfix\">\n\t\t\t\t<div class=\"et_pb_module et_pb_text et_pb_text_2 et_pb_bg_layout_light et_pb_text_align_left\">\n<div class=\"et_pb_text_inner\">\n<h6><strong>Version fran\u00e7aise :<\/strong><\/h6>\n<p>Le but du projet est d\u2019\u00e9crire un code informatique capable de g\u00e9n\u00e9rer des&nbsp;<em>paysages sonores<\/em>\u2026. Une mani\u00e8re de la faire est de faire appel \u00e0 une forme particuli\u00e8re d\u2019algorithme appel\u00e9e \u201c<a href=\"https:\/\/medium.com\/@davidroman0O\/r%C3%A9seaux-neuraux-pour-les-nuls-a7f5f63b1c10\">r\u00e9seau de neurone<\/a>\u201c. Ce dernier est capable, apr\u00e8s avoir travaill\u00e9 \u00e0 partir d\u2019une base de donn\u00e9es (<em>i<\/em><em>.e.&nbsp;<\/em>de paysages sonores pr\u00e9-existants),&nbsp;d\u2019apprendre&nbsp;\u201ccomment\u201d est fait un paysage sonore et, une fois bien entrain\u00e9e, devrait pouvoir produire des sons similaires \u00e0 de v\u00e9ritables paysages sonores, totalement nouveaux.<\/p>\n<p>Il existe de nombreuses mani\u00e8res de faire ceci\u2026 De nombreux r\u00e9seaux de neurones existents d\u00e9j\u00e0, travaillant avec des fichiers sonores et une multitude de bases de donn\u00e9es sont disponibles\u2026. La question reste \u00e0 savoir :<em>&nbsp;\u201d Quel type de paysage sonore veut-on g\u00e9n\u00e9rer ? Sur quelle dur\u00e9e et admettant quelles variations ? \u201c.<\/em>&nbsp;Ces param\u00e8tres sont d\u00e9terminants dans le choix de la m\u00e9thode \u00e0 utiliser.<\/p>\n<p>Pour commencer nous travaillerons \u00e0 partir de la riche&nbsp;<a href=\"https:\/\/sonotheque.mnhn.fr\/\">sonoth\u00e8que<\/a>&nbsp;du Mus\u00e9um National d\u2019Histoire Naturelle de Paris (France) sur des enregistrements de 30s&nbsp;<em>libres de droits<\/em>. Nous utiliserons un \u201c<a href=\"https:\/\/en.wikipedia.org\/wiki\/Autoencoder\"><em>Auto-Encoder<\/em><\/a>\u201d,&nbsp;<em>i.e.<\/em>&nbsp;un reseau de neurone compose de deux sous-r\u00e9seaux, l\u2019un travaillant \u00e0 extraire les informations les plus simples possible de chaque paysage sonore (enregistrement) sous forme d\u2019un court vecteur, l\u2019autre travaillant \u00e0 recomposer un paysage sonore \u00e0 partir de la multitude de petits vecteurs extraits par le premier.<\/p>\n<p>Th\u00e9oriquement nous pourrions ainsi obtenir un paysage sonore \u201cressemblant\u201d \u00e0 ceux sur lesquels s\u2019est entrain\u00e9 l\u2019ordinateur.<\/p>\n<p>Nous n\u2019avons n\u00e9anmoins qu\u2019un faible contr\u00f4le sur le produit fini de l\u2019algorithme et l\u2019ajout de contraintes se fera dans un second temps. Cela reste th\u00e9oriquement complexe et n\u00e9cessite des informations tierces parfois indisponibles (commentaires, meta-donn\u00e9es\u2026).<\/p>\n<p>Un travail futur sera d\u2019appliquer ce r\u00e9seau \u00e0 un environnement unique afin d\u2019en extraire et g\u00e9n\u00e9rer les principaux aspects. Nous pourrons aussi travailler avec des donn\u00e9es labelis\u00e9es et comment\u00e9es afin d\u2019extraire et de produire des informations qualitatives sur les paysages sonores.<\/p>\n<p>Mais gardons en t\u00eate\u2026 Les r\u00e9seaux de neurones sont encore mal compris et \u00e9chappent aux math\u00e9matiques contemporains les rendant parfois difficiles \u00e0 utiliser\u2026<\/p><p><br><\/p>\n<\/div>\n<\/div>\n<div class=\"et_pb_module et_pb_text et_pb_text_3 et_pb_bg_layout_light et_pb_text_align_left\">\n<div class=\"et_pb_text_inner\">\n<h6><em><strong>English version :<\/strong><\/em><\/h6>\n<p><em>The goal of the project is to design a neural network whose aim is to generate soundscapes from scratch : although we would need a database of pre-existing soundscapes to train the neural network, once it is fully trained it should be able to generate new random soundscapes that sound like real soundscapes.<\/em><\/p>\n<p><em>They are a lot of ways to do this. Many neural network architectures working well with sound recordings currently exist and have already been extensively studied. Similarly, there are many ways to choose the training database : it could consist in very different soundscapes as well as in many soundscapes recorded in the same environment, for example. Ultimately, all these choices need to be made with what type of soundscapes we want to generate. Do we want them to be diverse or variations on the same soundscape ? Do we want them to last a few seconds or a few hours ?<\/em><\/p>\n<p><em>For starters, I will be working with the Paris Natural History Museum soundbank, that provides open-access 30 seconds samples of many different soundscapes. I will thus try to design an Auto Encoder that is able to generate 30 seconds long soundscapes that may be very diverse. An Auto Encoder is a special kind of neural network that is actually composed of two sub-networks. The first one, the Encoder, tries to reduce the dimension of the data, i.e. extract tje main features, and thus outputs a small-dimensional vector for each soundscape. The second one, the Decoder, takes those small dimensional vectors and tries to reconstruct the original soundscapes from them. Once this network is trained, we can try to generate new soundscapes by taking a random vector and feeding it to the Decoder, hoping that the small-dimensional vector space in between the Encoder and the Decoder is a faithful representation of the space of soundscapes.<\/em><\/p>\n<p><em>In practice, this means that we don\u2019t get to choose what kind of soundscape the Decoder will generate, but by changing the Auto Encoder\u2019s architecture a little bit it is theoretically possible to add a feature selector. Of course, as long as we don\u2019t have any metadata on our recordings, it won\u2019t be possible to choose which features this feature selector would allow us to select.<\/em><\/p>\n<p><em>Once this first network works well, it should be possible to train it with other kind of soundbanks. For example, we\u2019d like to focus on soundcapes recorded in a single environment. It should also be possible to work with pre-labeled data (if we get some) to control the kind of soundscapes we generate and increase the training speed.<\/em><\/p>\n<p><em>All of this is of course only theoretical yet, and we have no guarantee it should work except the fact that it did in the past for other kinds of data : neural networks aren\u2019t very well understood mathematically.<\/em><\/p>\n<\/div>\n<\/div>\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Quentin Aristote s&#8217;est associ\u00e9 \u00e0 PePaSon pour son projet de Master en Informatique. L&#8217;objectif : coder un programme capable de cr\u00e9er ab initio un paysage sonore aux propri\u00e9t\u00e9s similaires \u00e0 celles d&#8217;un environnement donn\u00e9 et pr\u00e9-enregistr\u00e9. <\/p>\n","protected":false},"author":1,"featured_media":671,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"cybocfi_hide_featured_image":""},"categories":[6,5],"tags":[],"post_mailing_queue_ids":[],"_links":{"self":[{"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/posts\/669"}],"collection":[{"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/pepason.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=669"}],"version-history":[{"count":6,"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/posts\/669\/revisions"}],"predecessor-version":[{"id":699,"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/posts\/669\/revisions\/699"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/pepason.fr\/index.php?rest_route=\/wp\/v2\/media\/671"}],"wp:attachment":[{"href":"https:\/\/pepason.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=669"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/pepason.fr\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=669"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/pepason.fr\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=669"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}