On sub-gaussian concentration of missing mass
    
    
  
  
  
      
      
      
        
Teoriâ veroâtnostej i ee primeneniâ, Tome 68 (2023) no. 2, pp. 393-400
    
  
  
  
  
  
    
      
      
        
      
      
      
    Voir la notice de l'article provenant de la source Math-Net.Ru
            
              			The statistical inference on missing mass aims to estimate the weight of elements not observed during sampling. Since the pioneer work of Good and Turing, the problem has been studied in many areas, including statistical linguistics, ecology, and machine learning.
Proving the sub-Gaussian behavior of the missing mass has been notoriously hard, and a number of complicated arguments have been proposed: logarithmic Sobolev inequalities, thermodynamic approaches, and information-theoretic transportation methods. Prior works have argued that the difficulty is inherent, and classical tools are inadequate.
We show that this common belief is false, and all that we need to establish the sub-Gaussian concentration is the classical inequality of Bernstein. The strong educational value of our work is in its demonstration of this inequality in its full generality, an aspect not well recognized by researchers.
			
            
            
            
          
        
      
                  
                    
                    
                    
                    
                    
                      
Keywords: 
missing mass, measure concentration, heterogenic Bernstein's inequality
Mots-clés : sub-Gamma concentration.
                    
                  
                
                
                Mots-clés : sub-Gamma concentration.
@article{TVP_2023_68_2_a10,
     author = {M. Skorski},
     title = {On sub-gaussian concentration of missing mass},
     journal = {Teori\^a vero\^atnostej i ee primeneni\^a},
     pages = {393--400},
     publisher = {mathdoc},
     volume = {68},
     number = {2},
     year = {2023},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/TVP_2023_68_2_a10/}
}
                      
                      
                    M. Skorski. On sub-gaussian concentration of missing mass. Teoriâ veroâtnostej i ee primeneniâ, Tome 68 (2023) no. 2, pp. 393-400. http://geodesic.mathdoc.fr/item/TVP_2023_68_2_a10/
