Support for incident management in optical networks through critical points identification
Soporte de gestión de incidentes de redes ópticas a través de la identificación de puntos críticos
DOI:
https://doi.org/10.15446/ing.investig.v39n1.71346Keywords:
Fault Management, Optical Networks, Service Management, Risk Management, Business-driven IT Management. (en)Gestión de fallas, Redes ópticas, Administración de servicios, Gestión de riesgos, Gestión de TI impulsada por los negocios. (es)
In incident management for optical networks, when a fault or event occurs, a network element will often send a notification, in the form of an “alarm”, to operators and managers. Alarms contain valuable information to support the fault management process at the operating level, because it is a persistent indication of a fault. Alarms usually clear only with the triggering of the solution for its cause. To mitigate business risks related to faults in optical networks, service managers need to estimate the impact of a network fault in relation to business needs. In optical networks, identifying redundancy points of high-impact on the network is still a challenge for managers. They often rely on their own experience to prioritize points that possibly need to have redundancy in those networks. This work presents a simulation model capable of locating suitable points for the application of asset redundancy to reduce optical network disruptions, based on business risk. The model is implemented in a software tool and then used in a case study of two reference networks, whose elements may fail according to a realistic failure scenario. Results of the study allow face validity with preliminary evidence that the model is useful to support incident management in optical networks.
En la gestión de incidentes de redes ópticas, cuando ocurre una falla o evento, generalmente un componente de la red envía una notificación a los operadores y gerentes. Las alarmas contienen información importante para respaldar el proceso de gestión de fallas a nivel operativo, porque es una indicación persistente de una falla que se borra solo con la solución de condición de disparo. Para mitigar los riesgos comerciales relacionados con las fallas en las redes ópticas, los administradores de servicios deben estimar el impacto de una falla de red en relación con las necesidades del negocio. En las redes ópticas, la identificación de puntos de redundancia de alto impacto de red todavía es un desafío para los gerentes. Usualmente, estos gerentes confían en su propia experiencia para priorizar puntos que posiblemente deban tener redundancia en esas redes. En este trabajo, presentamos una evaluación de un modelo de simulación, capaz de localizar puntos adecuados para la aplicación de redundancia de activos y así reducir las interrupciones de la red óptica, en función del riesgo comercial. El modelo se implementó en una herramienta de software y se procedió a un estudio de caso que incluye dos escenarios de simulación de redes de referencia, con resultados prometedores.
References
Benhcine, T., Elbiaze, H. and Idoudi, K. (2013). Fast Reroutebased network resiliency experimental investigations. Paper presented at the 15th International Conference on Transparent Optical Networks (ICTON), Cartagena SP, IEEE, Universidad Politecnica de Cartagena. DOI: 10.1109/ICTON.2013.6603065
Bloomfield, R.E., Popov, P., Salako, K., Stankovic, V. and Wright, D. (2017). Preliminary Interdependency Analysis: An Approach to Support Critical-Infrastructure RiskAssessment. Reliability Engineering and System Safety, 167, 198-217. DOI: 10.1016/j.ress.2017.05.030
Casella, G. and Berger, R.L. (2002). Statistical Inference (2nd Ed.). California: Duxbury Advanced Series.
Da Silva, C. and Fagotto, E. A. M. (2014). Diagnóstico e tratamento de incidentes na rede de computadores. Paper presented at the 11th International Conference on Information Systems and Technology Management, Shangai, CONTECSI.
Fabre, E., Benveniste, A., Haar, S., Jard, C. and Aghasaryan, A. (2004). Algorithms for Distributed Fault Management in Telecommunications Networks. Paper presented at the 11th International Conference on Telecommunicatons (ICT’2004), Fortaleza, BR, IEEE. DOI: 10.1007/978-3-540-27824-5_108
Fen, Z., Yanqin, Z. and Li, C. (2016). Research on Metro Intelligent Optical Network Planning and Optimization. Paper presented at the15th International Conference on Optical Communications and Networks (ICOCN), Hangzhou, IEEE Photonics Society. DOI: 10.1109/ICOCN.2016.7875735
Gómez, J.M., Mora, M., Gewald, H., Nebel, W. and O’Connor, R.V. (2017). Engineering and Management of Data Centers: An IT Service Management Approach. New York: Springer International Publishing AG. DOI: 10.1007/978-3-319-65082-1
Hanemann, A., Sailer, M. and Schmitz,D. (2004). Assured Service Quality by Improved Fault Management, Paper presented at the 2nd international conference on Service oriented computing - ICSOC ’04, New York, ACM SIGSOFT, ACM SIGWEB, and University of Trento. DOI: 10.1145/1035167.1035194
Homma, M. and Shinomiya, N. (2016). Finding Tie-sets with the Minimal Number of Total Elements for Effective Failure Recovery. Paper presented at the 7th International Conference on Computing Communication and Networking Technologies, Dallas TX, IEEE. DOI: 10.1145/2967878.2967888
ITU-T. (1995). Rec. M3100. Generic Network Information Model. Geneva: International Telecommunication Union, Telecommunication Standardization Sector. Retrieved from: https://www.itu.int/rec/dologin.asp?lang=e&id=TREC-M.Imp3100-200008-S!!MSW-E&type=items
Lehr, G., Dassow, H., Zeffler, P., Gladisch, A. and Hanik, N. (1998). Management of All-Optical WDM Networks: First results of European research project MOON. Paper presented at the NOMS 98 -1998 IEEE Network Operations and Management Symposium, New Orleans, LA, IEEE. DOI: 10.1109/NOMS.1998.655229
Li-Xia, L. and Yue-Jin, Z. (2014). Design of a New EPON Connection Automatic Protection System, Paper presented at Ninth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, Guangdong, China, IEEE. DOI: 10.1109/3PGCIC.2014.121
Lu, K., Yahyapoura, R., Wieder, P., Yaquba, E., Abdullah, M., Schloer, B. and Kotsokalis, C. (2016). Fault-tolerant Service Level Agreement lifecycle management in clouds using actor system, Future Generation Computer Systems 54, 247–259. DOI: 10.1016/j.future.2015.03.016
Maltz, A., Yuan, L., Zhang, M., Wu, X., Turner, D.J. and Chen, C. (2015). Automated datacenter network failure mitigation, U.S. Patent No. 9,025,434. Washington, D C.: U.S. Patent and Trademark Office.
Mas, C. and Thiran, P. (2000). An Efficient Algorithm for Locating Soft and Hard Failures in WDM Networks. IEEE Journal on Selected Areas in Communications, 18 (10), 1900-1911. http://dx.doi.org/10.1109/49.887911
Mas, C., Krauß, S. and Casier, K. (2011). Fault Management and Service Provisioning Process Model of Next Generation Access Networks. Paper presented at the 7th International Conference on Network and Service Management (CNSM), Paris, IEEE
Mas, C., Thiran, P. and Le Boudec. J.Y. (1999). Fault location at the WDM Layer. Photonic Network Communication, 1 (3), 235-255. DOI: 10.1023/A:1010063713383
Mas., C., Tomkos, I. and Tomguz, O.K. (2005). Failure location algorithm for transparent optical networks. IEEE Journal on Selected Areas in Communications, 23 (8), 1508- 1519. DOI: 10.1109/JSAC.2005.852182
Meira, M. and Nogueira, J. M. S. (2000). A Recursive Approach for Alarm Correlation in Telecommunication Networks. Paper presented at the IFIP/IEEE Network Operations and Management Symposium (NOMS 2000), Honolulu, IEEE. DOI: 10.1109/NOMS.2000.830469
OGC - Office of Government Commerce. (2007). ITIL v3 (Information Technology Infrastructure Library). London: TSO.
Oliveira, E.W., Brito, A. E. M. and Brasileiro, F.V. (2003). Projeto e Implementaçao de um Serviço de Detecçao de Falhas Perfeito. Paper presented at the XXI Simpósio Brasileiro de Redes de Computadores, Natal, RN, Sociedade Brasileira de Computaçao and Laboratório Nacional de Redes de Computadores. Retrieved from: http://ce-resd.facom.ufms.br/sbrc/2003/044.pdf
Ramaswami, R and Sivarajan, K. N. (2001). Optical Networks – A Practical Perspective (2nd Ed.). San Francisco CA: Morgan Kaufmann.
Rodríguez-García, A., Ramírez-López, L. and Travieso-Torres, J.C. (2015). New heuristic algorithm for dynamic traffic in WDM optical networks. Ingeniería e Investigación, 35 (3), 100-106. DOI: 10.15446/ing.investig.v35n3.51676
Rowe, G. and Wright, G. (1997). The Delphi technique as a forecasting tool: issues and analysis, International Journal of Forecasting, 15 (4), 353–375. DOI: 10.1016/S0169-2070(99)00018-7
Runerson, P. and Host, M. (2009). Guidelines for conducting and reporting case study research in software engineering. Empirical Software Engineering, 14,131-164. DOI: 10.1007/s10664-008-9102-8
Sergio, M.C., De Souza, J.A. and Gon¸calves, A.L. (2017). Idea Identification Model to Support Decision Making. IEEE Latin America Transactions, 15 (5), 968 - 973. DOI: 10.1109/TLA.2017.7912594
Smith, P., Hutchison, D., Sterbenz, J.P.G., Scholler, M., Fessi, A., Karaliopoulos, M., Lac, C., Plattner, B. (2011). Network resilience: a systematic approach. IEEE Communications Magazine, 49(7), 88-97. DOI: 10.1109/MCOM.2011.5936160
Sousa, B., Delfino, C., De Sousa, JN. and Everardo J. (2005). An Algorithm for Fault Location, in SDH/WDM Networks. Paper presented at the 12th International Conference on Telecommunicatons (ICT’2005), Institute of Electrical and Electronics Engineers
Specialski, S. (2018). Gerencia de Redes de Computadores e de Telecomunica¸coes, white paper. Florianópolis: Universi- ˜ dade de Santa Catarina. Retrieved from: http://cassio.orgfree.com/disciplinas/gredes/ApostilaGerenciamento.pdf.
Stein, K.U. (1999). Redundancy-optimized communication network for the transmission of communication signals, U.S. Patent No. 5,946,294. Washington, D C.: U.S. Patent and Trademark Office
Sterbenz, J.P.G., Hutchison, D., ¸Cetinkaya, E.K., Jabbar, A., Rohrer, J.P., Scholler, M., Smith, P. (2010). Resilience and survivability in communication networks: Strategies, principles, and survey of disciplines. Computer Networks 54(8), 1245-1265. DOI: 10.1016/j.comnet.2010.03.005
Yin, R. K. (2018). Case Sudy Research and Applications. Design and Methods (6th Ed.). Los Angeles CA: SAGE Publications.
How to Cite
APA
ACM
ACS
ABNT
Chicago
Harvard
IEEE
MLA
Turabian
Vancouver
Download Citation
License
Copyright (c) 2019 Aminadabe Barbosa de Sousa, Alberto Sampaio Lima, José Neuman de Souza, José Antão Beltrão Moura

This work is licensed under a Creative Commons Attribution 4.0 International License.
The authors or holders of the copyright for each article hereby confer exclusive, limited and free authorization on the Universidad Nacional de Colombia's journal Ingeniería e Investigación concerning the aforementioned article which, once it has been evaluated and approved, will be submitted for publication, in line with the following items:
1. The version which has been corrected according to the evaluators' suggestions will be remitted and it will be made clear whether the aforementioned article is an unedited document regarding which the rights to be authorized are held and total responsibility will be assumed by the authors for the content of the work being submitted to Ingeniería e Investigación, the Universidad Nacional de Colombia and third-parties;
2. The authorization conferred on the journal will come into force from the date on which it is included in the respective volume and issue of Ingeniería e Investigación in the Open Journal Systems and on the journal's main page (https://revistas.unal.edu.co/index.php/ingeinv), as well as in different databases and indices in which the publication is indexed;
3. The authors authorize the Universidad Nacional de Colombia's journal Ingeniería e Investigación to publish the document in whatever required format (printed, digital, electronic or whatsoever known or yet to be discovered form) and authorize Ingeniería e Investigación to include the work in any indices and/or search engines deemed necessary for promoting its diffusion;
4. The authors accept that such authorization is given free of charge and they, therefore, waive any right to receive remuneration from the publication, distribution, public communication and any use whatsoever referred to in the terms of this authorization.










