Gentoo Archives: gentoo-soc

From: EBo <ebo@×××××××.com>
To: gentoo-soc@l.g.o
Subject: Re: [gentoo-soc] Weekly Report: Big Data Infrastructure and Maven Overlay in Week 5
Date: Sun, 05 Jul 2020 17:40:28
Message-Id: 948e781d116daa98177d0c778648f0d5@mail.swcp.com
In Reply to: Re: [gentoo-soc] Weekly Report: Big Data Infrastructure and Maven Overlay in Week 5 by Zhang Zongyu
1 Ahhh... thank you for your time and explanation. This all makes sense
2 now. I can see how having something that would pom.xml <==>
3 package.ebuild would be extremely handy. Best of luck to you you and
4 your team.
5
6 Thanks again,
7
8 EBo --
9
10 On Jul 5 2020 11:23 AM, Zhang Zongyu wrote:
11 > Hello,
12 >
13 > Here is the proposal of this project.[1]
14 > In a nutshell, the goal of this project is to intergrate Portage with
15 > Maven, or to translate Maven pom.xml into ebuild files.
16 > And the most related project should be java-ebuilder, which aims at
17 > processing pom.xml and forming an equivalent ebuild.[2]
18 > The deliverable of this project will be an overlay resolving all the
19 > dependencies of spark, which is widely used for big data processing,
20 > so my
21 > mentor named the project "big data infrastructure".
22 >
23 > Actually it is not related to big data processing or any "big data".
24 >
25 > I am sorry if it confused you, and I am glad that you are interested
26 > in
27 > this project.
28 >
29 > Regards,
30 > Zhang Zongyu
31 >
32 > [1] https://summerofcode.withgoogle.com/projects/#4994566568017920
33 > [2] https://github.com/6-6-6/java-ebuilder
34 >
35 > EBo <ebo@×××××××.com> 于 2020年7月6日周一 上午12:18写道:
36 >
37 >> Out of curiosity, can you point me to any project write up that was
38 >> probably part of the proposal? One of my last big-data projects
39 >> started
40 >> out with a 3.5 petabyte repository and filtered on down to cover 10
41 >> million contiguous kilometers. We did not use Maven, and I would be
42 >> curious to see where the project is going. Now that I know about
43 >> this I
44 >> will try to follow it a bit more.
45 >>
46 >> Best of success!
47 >>
48 >> On Jul 5 2020 10:05 AM, Zhang Zongyu wrote:
49 >> > Hello,
50 >> >
51 >> > This week I have done a lot of repetitive work. That is, to go
52 >> > through
53 >> > all of the ebuild files in Gentoo Repository and to get them the
54 >> > equivalent GroupId and ArtifactId in Maven Central. There are more
55 >> > than 500 packages in dev-java/ and java-virtual/ category, and
56 >> > sometimes one Gentoo package may correspond to more than one Maven
57 >> > artifact, so it took me a lot of time to complete them. And here
58 >> is
59 >> > my
60 >> > lookup up table which covers all packages in dev-java/ and maps
61 >> > "category/package:slot" into "groupId:artifactId".[1] As a result,
62 >> > spark-core can make use of more than 100 Gentoo native packages
63 >> now.
64 >> >
65 >> > Besides, I have also added java-ebuilder some features. It can
66 >> check
67 >> > the metadata of a package and get the correct SLOT, and it can
68 >> make
69 >> > ebuild unpack "Maven Central distributed source files" into proper
70 >> > ${JAVA_SRC_DIR} now.
71 >> >
72 >> > Next week I am going to test and add jaxb (javax.xml.bind:jaxb-api
73 >> > and
74 >> > javax.xml.bind:jaxb-impl) and some newer version packages which
75 >> are
76 >> > required by spark-core, and I hope the newer version ebuild files
77 >> > could be accepted by Gentoo-Java team. The prototype of the ebuild
78 >> > files that are just generated by java-ebuilder are listed in [2].
79 >> >
80 >> > Regards,
81 >> > Zhang Zongyu
82 >> >
83 >> > [1] lookup table containing all packages from dev-java/
84 >> >
85 >> >
86 >>
87 >> https://github.com/6-6-6/JEbuilder/blob/master/app-portage/java-ebuilder/files/scripts/LUT/LUT
88 >> > [2] a prototype overlay
89 >> > https://github.com/6-6-6/spark-overlay/tree/master/dev-java
90 >>
91 >>
92 >>