Use of Uncertain External Information in Statistical Estimation
by Sergey Tarima 1,* , Zhanna Zenkova 2
1 Medical College of Wisconsin, Division of Biostatistics, Institute for Health and Equity, Wauwatosa, Wisconsin, 53226, USA
2 National Research Tomsk State University, Institute of Applied Mathematics and Computer Science, Tomsk, 634002, Russia
* Author to whom correspondence should be addressed.
Journal of Engineering Research and Sciences, Volume 1, Issue 8, Page # 12-18, 2022; DOI: 10.55708/js0108002
Keywords: Additional information, Minimum variance, Minimum mean squared error, Statistical estimation
Received: 20 January 2022, Revised: 03 March 2022, Accepted: 26 July 2022, Published Online: 19 August 2022
APA Style
Tarima, S., & Zenkova, Z. (2022). Use of Uncertain External Information in Statistical Estimation. Journal of Engineering Research and Sciences, 1(8), 12–18. https://doi.org/10.55708/js0108002
Chicago/Turabian Style
Tarima, Sergey, and Zhanna Zenkova. “Use of Uncertain External Information in Statistical Estimation.” Journal of Engineering Research and Sciences 1, no. 8 (August 1, 2022): 12–18. https://doi.org/10.55708/js0108002.
IEEE Style
S. Tarima and Z. Zenkova, “Use of Uncertain External Information in Statistical Estimation,” Journal of Engineering Research and Sciences, vol. 1, no. 8, pp. 12–18, Aug. 2022, doi: 10.55708/js0108002.
A product’s life cycle hinges on its sales. Product sales are determined by a combination of market demand, industrial production, logistics, supply chains, labor hours, and countless other factors. Business-specific questions about sales are often formalized into questions relating to specific quantities in sales data. Statistical estimation of these quantities of interest is crucial but restricted availability of empirical data reduces the accuracy of such estimation. For example, under certain regularity conditions the variance of maximum likelihood estimators cannot be asymptotically lower than the Cramer-Rao lower bound. The presence of additional information from external sources therefore allows the improvement of statistical estimation. Two types of additional information are considered in this work: unbiased and possibly biased. In order to incorporate these two types of additional information in statistical estimation, this manuscript minimizes mean squared error and variance. Publicly available Walmart sales data from 45 stores across 2010-2012 is used to illustrate how these statistical methods can be applied to use additional information for estimating weekly sales. The holiday effect (sales spikes during holiday weeks) adjusted for overtime trends is estimated with the use of relevant external information.
- R. H. Hayes, “Statistical estimation problems in inventory control”, Manag. Sci., vol. 15, no. 11, pp. 686–701, 1969.
- Z.-P. Fan, Y.-J. Che, Z.-Y. Chen, “Product sales forecasting using online reviews and historical sales data: A method combining the bass model and sentiment analysis”, Journal of Business Research, vol. 74, pp. 90–100, 2017.
- L. H. Liyanagea, J. Shanthikumar, “Apractical inventory control policy using operational statistics”, Oper. Res. Lett., , no. 33, pp. 341–348, 2005.
- S. Tarima, D. Pavlov, “Using auxiliary information in statistical func- tion estimation”, ESAIM: Probab. Stat., vol. 10, pp. 11–23, 2006.
- S. Tarima, S. Slavova, T. Fritsch, L. Hall, “Probability estimation when some observations are grouped”, Stat. Med., vol. 26, no. 8, pp. 1745–1761, 2007.
- tic z and chi-squared tests with auxiliary ation”, Metrika, vol. XX, pp. xx–xx, 2022.
- S. Tarima, K. Patel, R. Sparapani, M. O’Brien, L. Cassidy, J. Meurer, “Use of previously published data in statistical estimation”, “Interna- tional Conference on Risk Analysis”, pp. 78–88, Springer, 2022.
- S. Tarima, Y. Dmitriev, “Statistical estimation with possibly incorrect model assumptions”, Bul. Tomsk St. University: cont., comput., inf., vol. 8, pp. 78–99, 2009.
- S. Tarima, A. Vexler, S. Singh, “Robust mean estimation under a possibly incorrect log-normality assumption”, Commun. Stat.–Simul. C., vol. 42, no. 2, pp. 316–326, 2013.
- Y. Dmitriev, P. Tarassenko, Y. Ustinov, “On estimation of linear func- tional by utilizing a prior guess”, A. Dudin, A. Nazarov, R. Yakupov, Gortsev, eds., “Information Technologies and Mathematical Mod- elling”, pp. 82–90, Springer International Publishing, Cham, 2014.
- Y. Dmitriev, P. Tarassenko, “On adaptive estimation using a prior guess”, “Applied methods of statistical analysis. Nonparametric ap- proach – AMSA2015”, pp. 49–55, Novosibirsk, September, 2015.
- Y. Dmitriev, P. Tarassenko, F. Tarassenko, “On improving statistical estimation by utilizing collateral information (guesses): a case of the probability estimation”, “International Workshop on Applied Methods of Statistical Analysis: Nonparametric methods in cybernetics and system analysis”, pp. 262–269, Krasnoyarsk, September, 2017.
- Y. Dmitriev, G. Koshkin, V. Lukov, “Combined identification and prediction algorithms”, “IV International Research Conference: In- formation Technologies in Science, Management, Social Sphere and Medicine”, pp. 244–247, Tomsk, December, 2017.
- Z. Zenkova, E. Krainova, “Estimating the net premium using ad- ditional information about a quantile of the cumulative distribu- tion function”, Bus. Inform., vol. 42, no. 4, pp. 55–63, 2017, doi: 10.17323/1998-0663.2017.4.55.63.
- S. Tarima, Z. Zenkova, “Use of uncertain additional information in newsvendor models”, “2020 5th International Conference on Logistics Operations Management (GOL)”, pp. 1–6, IEEE, 2020.
- R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2021.
- N. Stojanović, M. Soldatović, M. Milićević, “Walmart recruiting–store sales forecasting”, “Proceedings of the XIV International Symposium Symorg”, p. 135, 2014.
- M. Singh, B. Ghutla, R. Lilo Jnr, A. F. S. Mohammed, M. A. Rashid, “Walmart’s sales data analysis – a big data analytics per- spective”, “2017 4th Asia-Pacific World Congress on Computer Sci- ence and Engineering (APWC on CSE)”, pp. 114–119, 2017, doi: 10.1109/APWConCSE.2017.00028.
- C. Catal, E. Kaan, B. Arslan, A. Akbulut, “Benchmarking of regression algorithms and time series analysis techniques for sales forecasting”, Balkan Journal of Electrical and Computer Engineering, vol. 7, no. 1, pp. 20–26, 2019.
- S. Tarima, B. Tuyishimire, R. Sparapani, L. Rein, J. Meurer, “Estima- tion combining unbiased and possibly biased estimators”, Journal of Statistical Theory and Practice, vol. 14, no. 2, pp. 1–20, 2020.
- J. P. Higgins, J. Thomas, J. Chandler, M. Cumpston, T. Li, M. J. Page,
V. A. Welch, “Cochrane handbook for systematic reviews of interven- tions version 6.2 (updated february 2021)”, https://www.training. cochrane.org/handbook. - S. Calderazzo, S. Tarima, C. Reid, N. Flournoy, T. Friede, N. Geller, J. L. Rosenberger, N. Stallard, M. Ursino, M. Vandemeulebroecke, K. Van Lancker, S. Zohar, “Coping with information loss and the use of auxiliary sources of data: A report from the niss ingram es on unplanned clinical trial disruptions”, 2022, 10.48550/ARXIV.2206.11238.