Towards Community-Driven Generative AI

Rustem Dautov; Erik Johannes Husom; Sagar Sen; Hui Song

Towards Community-Driven Generative AI

Rustem Dautov, Erik Johannes Husom, Sagar Sen, Hui Song

DOI: http://dx.doi.org/10.15439/2023F5494

Citation: Position Papers of the 18th Conference on Computer Science and Intelligence Systems, M. Ganzha, L. Maciaszek, M. Paprzycki, D. Ślęzak (eds). ACSIS, Vol. 36, pages 43–50 (2023)

Full text

Abstract. While the emerging market of Generative Artificial Intelligence (AI) is increasingly dominated and controlled by the Tech Giants, there is also a growing interest in open-source AI code and models from smaller companies, research organisations and individual users. They often have valuable data that could be used for training, but their computing resources are limited, while data privacy concerns prevent them from sharing this data for public training. A possible solution to overcome these two issues is to utilise the crowd-souring principles and apply federated learning techniques to build a distributed privacy-preserving architecture for training Generative AI. This paper discusses how these two key enablers, together with some other emerging technologies, can be effectively combined to build a community-driven Generative AI ecosystem, allowing even small actors to participate in the training of Generative AI models by securely contributing their training data. The paper also discusses related non-technical issues, such as the role of the community and intellectual property rights, and outlines further research directions associated with AI moderation.

References

V. Talla, M. Hessar, B. Kellogg, A. Najafi, J. R. Smith, and S. Gollakota, “LoRa Backscatter: Enabling The Vision of Ubiquitous Connectivity,” Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies, vol. 1, no. 3, pp. 1–24, 2017, https://doi.org/10.1145/3130970.
M. R. Ebling, “Pervasive Computing and the Internet of Things,” IEEE Pervasive Computing, vol. 15, no. 1, pp. 2–4, 2016, https://doi.org/10.1109/MPRV.2016.7.
R. Dautov and S. Distefano, “Three-level hierarchical data fusion through the IoT, edge, and cloud computing,” in Proceedings of the 1st International Conference on Internet of Things and Machine Learning. ACM New York, NY, USA, 2017, pp. 1–5, https://doi.org/10.1145/3109761.3158388.
R. Dautov, S. Distefano, D. Bruneo, F. Longo, G. Merlino, and A. Puliafito, “Pushing intelligence to the edge with a stream processing architecture,” in 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData). IEEE, 2017, pp. 792–799, https: //doi.org/10.1109/iThings-GreenCom-CPSCom-SmartData.2017.121.
X. Wang, Y. Han, V. C. Leung, D. Niyato, X. Yan, and X. Chen, Edge AI: Convergence of edge computing and artificial intelligence. Springer, 2020, https://doi.org/10.1007/978-981-15-6186-3.
E. J. Husom, R. Dautov, A. Nedisan Videsjorden, F. Gonidis, S. Papatzelos, and N. Malamas, “Machine Learning for Fatigue Detection using Fitbit Fitness Trackers,” in Proceedings of the 10th International Conference on Sport Sciences Research and Technology Support - icSPORTS, INSTICC. SciTePress, 2022, pp. 41–52, https://doi.org/10.5220/0011527500003321.
R. Dautov, E. J. Husom, F. Gonidis, S. Papatzelos, and N. Malamas, “Bridging the Gap Between Java and Python in Mobile Software Development to Enable MLOps,” in 2022 18th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob). IEEE, 2022, pp. 363–368, https://doi.org/10.1109/WiMob55322.2022.9941679.
R. Dautov, S. Distefano, G. Merlino, D. Bruneo, F. Longo, and A. Puliafito, “Towards a Global Intelligent Surveillance System,” in Proceedings of the 11th International Conference on Distributed Smart Cameras. ACM New York, NY, USA, 2017, pp. 119–124, https://doi.org/10.1145/3131885.3131918.
R. Dautov, S. Distefano, D. Bruneo, F. Longo, G. Merlino, A. Puliafito, and R. Buyya, “Metropolitan intelligent surveillance systems for urban areas by harnessing IoT and edge computing paradigms,” Software: Practice and experience, vol. 48, no. 8, pp. 1475–1492, 2018, https://doi.org/10.1002/spe.2586.
J. Konečnỳ, H. B. McMahan, F. X. Yu, P. Richtárik, A. T. Suresh, and D. Bacon, “Federated learning: Strategies for improving communication efficiency,” in 29th Conference on Neural Information Processing Systems (NIPS2016), 2016, pp. 1–5, https://doi.org/10.48550/arXiv.1610.05492.
I. Hegedűs, G. Danner, and M. Jelasity, “Gossip Learning as a Decentralized Alternative to Federated Learning,” in Distributed Applications and Interoperable Systems (DAIS 2019), J. Pereira and L. Ricci, Eds. Springer, 2019, pp. 74–90, https://doi.org/10.1007/978-3-030-22496-7_5.
G. Li, Y. Hu, M. Zhang, L. Li, T. Chang, and Q. Yin, “FedGosp: A Novel Framework of Gossip Federated Learning for Data Hetero-geneity,” in 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE, 2022, pp. 840–845, https://doi.org/10.1109/SMC53654.2022.9945192.
S. Boyd, A. Ghosh, B. Prabhakar, and D. Shah, “Gossip algorithms: Design, analysis and applications,” in Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies., vol. 3. IEEE, 2005, pp. 1653–1664, https://doi.org/10.1109/INFCOM.2005.1498447.
D. Shah, “Gossip algorithms,” Foundations and Trends® in Networking, vol. 3, no. 1, pp. 1–125, 2009, https://dx.doi.org/10.1561/1300000014.
H. O. Ikediego, M. Ilkan, A. M. Abubakar, and F. V. Bekun, “Crowd-sourcing (who, why and what),” International Journal of Crowd Science, vol. 2, no. 1, pp. 27–41, 2018, https://doi.org/10.1108/IJCS-07-2017-0005.
L. Duan, T. Kubo, K. Sugiyama, J. Huang, T. Hasegawa, and J. Walrand, “Incentive mechanisms for smartphone collaboration in data acquisition and distributed computing,” in 2012 Proceedings IEEE INFOCOM. IEEE, 2012, pp. 1701–1709, https://doi.org/10.1109/INFCOM.2012.6195541.
I. Kremer, Y. Mansour, and M. Perry, “Implementing the “wisdom of the crowd”,” Journal of Political Economy, vol. 122, no. 5, pp. 988–1012, 2014, https://doi.org/10.1086/676597.
T. Gillespie, “Content moderation, ai, and the question of scale,” Big Data & Society, vol. 7, no. 2, p. 2053951720943234, 2020, https://doi.org/10.1177/2053951720943234.
D. P. Anderson, J. Cobb, E. Korpela, M. Lebofsky, and D. Werthimer, “SETI@home: an experiment in public-resource computing,” Commun. ACM, vol. 45, no. 11, pp. 56–61, 2002, https://doi.org/10.1145/581571.581573.
S. Luper, “Epistemic relativism,” Philosophical Issues, vol. 14, pp. 271–295, 2004, https://doi.org/10.1111/j.1533-6077.2004.00031.x.
A. Dorri, S. S. Kanhere, and R. Jurdak, “Multi-Agent Systems: A Survey,” IEEE Access, vol. 6, pp. 28 573–28 593, 2018, https: //doi.org/10.1109/ACCESS.2018.2831228.
A. Chakraborty and A. K. Kar, “Swarm Intelligence: A Review of Algorithms,” in Nature-inspired computing and optimization: Theory and applications, S. Patnaik, X.-S. Yang, and K. Nakamatsu, Eds. Springer, 2017, pp. 475–494, https://doi.org/10.1007/978-3-319-50920-4_19.
G. A. Mashour, P. Roelfsema, J.-P. Changeux, and S. Dehaene, “Conscious Processing and the Global Neuronal Workspace Hypothesis,” Neuron, vol. 105, no. 5, pp. 776–798, 2020, https://doi.org/10.1016/j.neuron.2020.01.026.
R. VanRullen and R. Kanai, “Deep learning and the global workspace theory,” Trends in Neurosciences, vol. 44, no. 9, pp. 692–704, 2021, https://doi.org/10.1016/j.tins.2021.04.005.
H. Zhang, T. Nakamura, T. Isohara, and K. Sakurai, “A Review on Machine Unlearning,” SN Computer Science, vol. 4, no. 4, p. 337, 2023, https://doi.org/10.1007/s42979-023-01767-4.
H. Xu, T. Zhu, L. Zhang, W. Zhou, and P. S. Yu, “Machine unlearning: A survey,” ACM Computing Surveys, 2023, https://doi.org/10.1145/3603620.
L. Wu, S. Guo, J. Wang, Z. Hong, J. Zhang, and Y. Ding, “Federated Unlearning: Guarantee the Right of Clients to Forget,” IEEE Network, vol. 36, no. 5, pp. 129–135, 2022, https://doi.org/10.1109/MNET.001.2200198.
X. Gao, X. Ma, J. Wang, Y. Sun, B. Li, S. Ji, P. Cheng, and J. Chen, “VeriFi: Towards Verifiable Federated Unlearning,” Computing Research Repository, 2022, https://doi.org/10.48550/arXiv.2205.12709.