Statistical Estimators Using Jointly Administrative and Survey Data to Produce French Structural Business Statistics

Brion, Philippe; Gros, Emmanuel
December 2015
Journal of Official Statistics (JOS);Dec2015, Vol. 31 Issue 4, p589
Academic Journal
Using as much administrative data as possible is a general trend among most national statistical institutes. Different kinds of administrative sources, from tax authorities or other administrative bodies, are very helpful material in the production of business statistics. However, these sources often have to be completed by information collected through statistical surveys. This article describes the way Insee has implemented such a strategy in order to produce French structural business statistics. The originality of the French procedure is that administrative and survey variables are used jointly for the same enterprises, unlike the majority of multisource systems, in which the two kinds of sources generally complement each other for different categories of units. The idea is to use, as much as possible, the richness of the administrative sources combined with the timeliness of a survey, even if the latter is conducted only on a sample of enterprises. One main issue is the classification of enterprises within the NACE nomenclature, which is a cornerstone variable in producing the breakdown of the results by industry. At a given date, two values of the corresponding code may coexist: the value of the register, not necessarily up to date, and the value resulting from the data collected via the survey, but only from a sample of enterprises. Using all this information together requires the implementation of specific statistical estimators combining some properties of the difference estimators with calibration techniques. This article presents these estimators, as well as their statistical properties, and compares them with those of other methods.


Related Articles

  • Editorial. Heiser, Willem // Journal of Classification;Jul2012, Vol. 29 Issue 2, p117 

    An introduction is presented in which the editor discusses various articles within the issue on topics including a new approach to hierarchical clustering, general-purpose indices and missing data imputation and data fusion.

  • A Fast Matching Method for Fingerprints with Less Than Two Global Singular Points. Wu Xiaobing; Liu Shuyan; Li Kejie; Sharma, Dharmendra // International MultiConference of Engineers & Computer Scientists;2007, p89 

    A fast local pattern matching method for fingerprints with less than two singulars is proposed in this paper. The method has three steps: pre-processing, minutia extraction, and matching based on local pattern. In pre-processing, a twice smoothing method is used to get a high qualification...

  • Improved analysis of orthogonal matching pursuit in general perturbations. Xiaobo Zhang; Wenbo Xu; Yun Tian; Jiaru Lin // Electronics Letters;3/8/2018, Vol. 54 Issue 5, p295 

    Previous research on orthogonal matching pursuit (OMP) algorithm mainly focuses on the recovery performance of a sparse signal x given an acquired model y = Fx + n. A general perturbation model y = (F + E)x + n in addition to the above acquired model exists, where E is the measurement...

  • Imputation.  // Retail Trade;Jun2008, Vol. 80 Issue 6, p76 

    The article offers information on the process of data imputation of the Monthly Retail Trade Survey (MRTS) in Canada. This is done by assigning values when they are missing on the record being tested to ensure that estimates are of high quality and internal consistency is created. It has three...

  • Maximum Consistency of Incomplete Data via Non-Invasive Imputation. Gediga, Günther; Düntsch, Ivo // Artificial Intelligence Review;Mar2003, Vol. 19 Issue 1, p93 

    Presents an algorithm to impute missing values from given data alone. Performance of the algorithm; Use of a non-numeric rule based data analysis; Maximization of consistency of imputation from known values; Contrast to the prevailing statistical imputation algorithms; Suitability for a wide...

  • Traffic Speed Data Imputation Method Based on Tensor Completion. Ran, Bin; Tan, Huachun; Feng, Jianshuai; Liu, Ying; Wang, Wuhong // Computational Intelligence & Neuroscience;3/3/2015, Vol. 2015, p1 

    Traffic speed data plays a key role in Intelligent Transportation Systems (ITS); however, missing traffic data would affect the performance of ITS as well as Advanced Traveler Information Systems (ATIS). In this paper, we handle this issue by a novel tensor-based imputation approach....

  • Multiple imputation of covariates by substantive-model compatible fully conditional specification. Bartlett, Jonathan W.; Morris, Tim P. // Stata Journal;2015, Vol. 15 Issue 2, p437 

    Multiple imputation is a practical, principled approach to handling missing data. When used to impute missing values in covariates of regression models, imputation models may be misspecified if they are not compatible with the substantive model of interest for the outcome. In this article, we...

  • Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions. Crespo Turrado, Concepción; Meizoso López, María del Carmen; Sánchez Lasheras, Fernando; Rodríguez Gómez, Benigno Antonio; Calvo Rollé, José Luis; Javier de Cos Juez, Francisco // Sensors (14248220);2014, Vol. 14 Issue 11, p20382 

    Global solar broadband irradiance on a planar surface is measured at weather stations by pyranometers. In the case of the present research, solar radiation values from nine meteorological stations of the MeteoGalicia real-time observational network, captured and stored every ten minutes, are...

  • Expanding HadISD: quality-controlled, sub-daily station data from 1931. Dunn, Robert J. H.; Willett, Kate M.; Parker, David E.; Mitchell, Lorna // Geoscientific Instrumentation, Methods & Data Systems (GI);2016, Vol. 5 Issue 2, p473 

    HadISD is a sub-daily, station-based, qualitycontrolled dataset designed to study past extremes of temperature, pressure and humidity and allow comparisons to future projections. Herein we describe the first major update to the HadISD dataset. The temporal coverage of the dataset has been...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics